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1 ATGAGCCAGC CCAGGCCCCG CTACGTGGTA GACAGAGCCG CATACTCCCT 
51 TACCCTCTTC GACGATGAGT TTGAGAAGAA GGACCGGACA TACCCAGTGG 
101 GAGAGAAACT TCGCAATGCC TTCAGATGTT CCTCAGCCAA GATCAAAGCT 
151 GTGGTGTTTG GGCTGCTGCC TGTGCTCTCC TGGCTCCCCA AGTACAAGAT 
201 TAAAGACTAC ATCATTCCTG ACCTGCTCGG TGGACTCAGC GGGGGATCCA 
251 TCCAGGTCCC ACAAGGCATG GCATTTGCTC TGCTGGCCAA CCTTCCTGCA 
301 GTCAATGGCC TCTACTCCTC CTTCTTCCCC CTCCTGACCT ACTTCTTCCT 
351 GGGGGGTGTT CACCAGATGG TGCCAGGTAC CTTTGCCGTT ATCAGCATCC 
401 TGGTGGGTAA CATCTGTCTG CAGCTGGCCC CAGAGTCGAA ATTCCAGGTC 
451 TTCAACAATG CCACCAATGA GAGCTATGTG GACACAGCAG CCATGGAGGC 
501 TGAGAGGCTG CACGTGTCAG CTACGCTAGC CTGCCTCACC GCCATCATCC 
551 AGATGGGTCT GGGCTTCATG CAGTTTGGCT TTGTGGCCAT CTACCTCTCC 
601 GAGTCCTTCA TCCGGGGCTT CATGACGGCC GCCGGCCTGC AGATCCTGAT 
651 TTCGGTGCTC AAGTACATCT TCGGACTGAC CATCCCCTCC TACACAGGCC 
701 CAGGGTCCAT CGTCTTTACC TTCATTGACA TTTGCAAAAA CCTCCCCCAC 
751 ACCAACATCG CCTCGCTCAT CTTCGCTCTC ATCAGCGGTG CCTTCCTGGT 
801 GCTGGTGAAG GAGCTCAATG CTCGCTACAT GCACAAGATT CGCTTCCCCA 
851 TCCCTACAGA GATGATTGTG GTGGTGGTGG CAACAGCTAT CTCCGGGGGC 
901 TGTAAGATGC CCAAAAAGTA TCACATGCAG ATCGTGGGAG AAATCCAACG 
951 CGGGTTCCCC ACCCCGGTGT CGCCTGTGGT CTCACAGTGG AAGGACATGA 
1001 TAGGCACAGC CTTCTCCCTA GCCATCGTGA GCTACGTCAT CAACCTGGCT 
1051 ATGGGCCGGA CCCTGGCCAA CAAGCACGGC TACGACGTGG ATTCGAACCA 
1101 GGAGATGATC GCTCTCGGCT GCAGCAACTT CTTTGGCTCC TTCTTTAAAA 
1151 TTCATGTCAT TTGCTGTGCG CTTTCTGTCA CTCTGGCTGT GGATGGAGCT 
1201 GGAGGAAAAT CCCAGGTGGC CAGCCTGTGT GTGTCTCTGG TGGTGATGAT 
1251 CACCATGCTG GTCCTGGGGA TCTATCTGTA TCCTCTCCCT AAGTCTGTGC 
1301 TAGGAGCCCT GATCGCTGTC AATCTCAAGA ACTCCCTCAA GCAACTCACC 
1351 GACCCCTACT ACCTGTGGAG GAAGAGCAAG CTGGACTGTT GCATCTGGGT 
1401 AGTGAGCTTC CTCTCCTCCT TCTTCCTCAG CCTGCCCTAT GGTGTGGCAG 
1451 TGGGTGTCGC CTTCTCCGTC CTGGTCGTGG TCTTCCAGAC TCAGTTTCGA 
1501 AATGGCTATG CACTGGCCCA GGTCATGGAC ACTGACATTT ATGTGAATCC 
1551 CAAGACCTAT AATAGGGCCC AGGATATCCA GGGGATTAAA ATCATCACGT 
1601 ACTGCTCCCC TCTCTACTTT GCCAACTCAG AGATCTTCAG GCAAAAGGTC 
1651 ATCGCCAAGA CTGTCTCCCT GCAGGAGCTG CAGCAGGACT TTGAGAATGC 
1701 GCCCCCCACC GACCCCAACA ACAACCAGAC CCCGGCTAAC GGCACCAGCG 
1751 TGTCCTATAT CACCTTCAGC CCTGACAGCT CCTCACCTGC CCAGAGTGAG 
1801 CCACCAGCCT CCGCTGAGGC CCCCGGCGAG CCCAGTGACA TGCTGGCCAG 
1851 CGTCCCACCC TTCGTCACCT TCCACACCCT CATCCTGGAC ATGAGTGGAG 
1901 TCAGCTTCGT GGACTTGATG GGCATCAAGG CCCTGGCCAA GCTGAGCTCC 
1951 ACCTATGGGA AGATCGGCGT GAAGGTCTTC TTGGTGAACA TCCATGCCCA 
2001 GGTGTACAAT GACATTAGCC ATGGAGGCGT CTTTGAGGAT GGGAGTCTAG 
2051 AATGCAAGCA CGTCTTTCCC AGCATACATG ACGCAGTCCT CTTTGCCCAG 
2101 GCAAATGCTA GAGACGTGAC CCCAGGACAC AACTTCCAAG GGGCTCCAGG 
2151 GGATGCTGAG CTCTCCTTGT ACGACTCAGA GGAGGACATT CGCAGCTACT 
2201 GGGACTTAGA GCAGGAGATG TTCGGGAGCA TGTTTCACGC AGAGACCCTG 
2251 ACCGCCCTGT GA (SEQ ID N0:1) 

FEATURES: 
Start Codon: 1 
Stop Codon: 2260 
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HOMOLOGOUS PROTEINS: 

TOP BLAST Hits: 

gb|AAF81911.1|AF279265_l CAF279265) putative anion transpor. 
gbLAAF71715.1|AF230376_l (AF230376) prestin [Meriones unqui 



ref NP_000432.1 
ref NP_035997.1 
ref NP_062087.1 
ref NP^067328.1 
ref NP_000102.1 
Sp|O70531|DTD_RAT 
ref |NP^000103.1| 
ref |NP_031911.1| 



pendrin [Homo sapiens] >gi | 11421915 | ref jXP. 
Pendred syndrome homolog (human); Pendred', 
Pendred syndrome homolog (human) [Rattus n. 
down- regulated in adenoma [Mus musculus] >. 
down- regulated in adenoma protein [Homo sa, 

SULFATE TRANSPORTER (DIASTROPHIC DYSPLASI. 

sulfate anion transporter 1; Diastrophic d. 
diastrophic dysplasia [Mus musculus] >gi|2- 
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4e-97 



BLAST to dbEST: 

gi 1 8630793 /dataset=dbest /taxon=960. . . 

EXPRESSION INFORMATION FOR MODULATORY USE: 

library source: 

Expression information from BLTVST dbEST hits: 
gi 1 8630793 Human head-neck 



Expression information from PCR-based tissue screening panels; 
Human fetal lung 
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1 MSQPRPRYW DRAAYSLTLF DDEFEKKDRT YPVGEKLRNA FRCSSAKIKA 
51 WFGLLPVLS WLPKYKIKDY IIPDLLGGLS GGSIQVPQGM AFALLANLPA 
101 VNGLYSSFFP LLTYFFLGGV HQMVPGTFAV ISILVGNICL QLAPESKFQV 
151 FNNATNESYV DTAAMEAERL HVSATLACLT AIIQMGLGFM QFGFVAIYLS 
201 ESFIRGFMTA AGLQILISVL KYIFGLTIPS YTGPGSIVFT FIDICKNLPH 
251 TNIASLIFAL ISGAFLVLVK ELNARYMHKI RFPIPTEMIV VWATAISGG 
301 CKMPKKYHMQ IVGEIQRGFP TPVSPWSQW KDMIGTAFSL AIVSYVINLA 
351 MGRTLANKHG YDVDSNQEMI ALGCSNFFGS FFKIHVICCA LSVTLAVDGA 
401 GGKSQVASLC VSLWMITML VLGIYLYPLP KSVLGALIAV NLKNSLKQLT 
451 DPYYLWRKSK LDCCIWWSF LSSFFLSLPY GVAVGVAFSV LVWFQTQFR 
501 NGYALAQVMD TDIYVNPKTY NRAQDIQGIK IITYCSPLYF ANSEIFRQKV 
551 lAKTVSLQEL QQDFENAPPT DPNNNQTPAN GTSVSYITFS PDSSSPAQSE 
601 PPASAEAPGE PSDMLASVPP FVTFHTLILD MSGVSFVDLM GIKALAKLSS 
651 TYGKIGVKVF LVNIHAQVYN DISHGGVFED GSLECKHVFP SIHDAVLFAQ 
701 ANARDVTPGH NFQGAPGDAE LSLYDSEEDI RSYWDLEQEM FGSMFHAETL 
751 TAL (SEQ ID NO: 2) 

FEATURES: 

Functional domains and key regions: 

[1] PDOCOOOOl PSOOOOl ASN_GLYCOSYLATION 

N-glycosylation site 

Number of matches: 3 

1 153-156 NATN 

2 156-159 NESY 

3 580-583 NGTS 

[2] PDOC00005 PS00005 PKC_PH0SPH0_SITE 
Protein kinase C phosphorylation site 

Number of matches: 2 

1 45-47 SAK 

2 445-447 SLK 

[3] PDOC00006 PS00006 CK2_PH0SPH0_SITE 
Casein kinase li phosphorylation site 
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[4] PDOC00007 PS00007 TYR_PHOSPHO_SITE 
Tyrosine kinase phosphorylation site 

Number of matches: 2 

1 7-15 RYWDRAAY 

2 447-454 KQLTDPYY 
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[5] PDOC00008 PS00008 MYRISTYL 

N-myristoylation site 
Number of matches: 10 
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Membrane spanning structure and domains: 
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BLAST Alignment to Top Hit: 

>gb|AAF81911.1|AF279265_l (AF279265) putative anion transporter 1 [Homo 
sapiens] 

Length = 738 
Score = 476 bits (1224), Expect = e-133 

Identities = 263/724 (36%), Positives = 428/724 (58%). Gaps = 36/724 (4%) 
Frame = +3 

Query: 54 lfddefeekdr--typvgeklrnafrcssakikawfgllpvlswlpkykikdyiipdll 227 

L + EER+P +R +CS A+ A++ LPVL WLP+Y ++D+++ DLL 

sbjct: 15 lnqehleelgrwgsaprthqwrtwlqcsrarayalllqhlpvlvwlprypvrdwllgdll 74 
Query: 228 gglsggsiqvpqgmafallanlpavnglyssffplltyfflggvhqmvpgtfavisilvg 407 

GLS +Q+PQG+A+ALLA LP V GLYSSF+P+ YF G + GTFAV+S++VG 

sbjct: 75 sglsvaimqlpqglayallaglppvfglyssfypvfiyflfgtsrhisvgtfavmsvmvg 134 
Query: 408 niclqlapeskfqvfnnatnesyvdtaameaerlhvsatlacltaiiqmglgfmqfgfva 587 

++ LAP+ A N+S ++ A +A R-f V++TL+ L + Q+GLG + FGFV 

sbjct: 135 SVTESLAPQ ALNDSMINETARDAARVQVASTLSVLVGLFQVGLGLIHFGFW 186 

Query: 588 IYLSESFIRGFMTAAGLQILISVLKYIFGLTIPSYTGPGSIVFTFIDICKNLPHTNIASL 767 

YLSE +RG+ TAA +Q+ +S LKY+FGL + S-f+GP S+++T -f++C LP + -f ++ 
sbjct: 187 TYLSEPLVRGYTTAAAVQVFVSQLKYVFGLHLSSHSGPLSLIYTVLEVCWKLPQSKVGTV 246 

Query: 768 IFALISGAFLVLVKELNARYMHKIRFPIPTEMIWWATAISGGCKMPKKYHMQIVGEIQ 947 

+ A ++G LV+VK LN + ++ PIP E++ ++ AT IS G + ++ + +VG I 
sbjct: 247 VTAAVAGWLVWKLLNDKLQQQLPMPIPGELLTLIGATGISYGMGLKHRFEVDWGNIP 306 

Query: 948 RGFPTPVSPWLQWKDMIGTAFSLAIVSYVINLAMGRTLANKHGYDVDSNQEMIALGCSN 1127 

G PV+P + ++G+AF++A+V -f I +++G+ A +HGY VDSNQE++ALG SN 
Sbjct: 307 AGLVPPVAPNTQLFSKLVGSAFTIAWGFAIAISLGKIFALRHGYRVDSNQELVALGLSN 366 

Query: 1128 FFGSFFKIHVICCALSVTLAVDGAGGKSQVASLCVSLWMITMLVLGIYLYPLPKSVLGA 1307 

G F+ + C++S +L + GG SQVA SL +++ ++ LG + LPK+VL A 
Sbjct: 367 LIGGIFQCFPVSCSMSRSLVQESTGGNSQVAGAISSLFILLIIVKLGELFHDLPKAVLAA 426 

Query: 1308 LIAVNLKNSLKQLTDPYYLWRKSKLDCCIWWSFLSSFFLSLPYGVAVGVAFSVLVWFQ 1487 

+1 VNLK L-l-QL+D LW+ ++ D IW+V+F +-1- L+L G+ V V FS+L+W + 
Sbjct: 427 IIIVNLKGMLRQLSDMRSLWKANRADLLIWLVTFTATILLNLDLGLWAVIFSLLLVWR 486 

Query: 1488 TQFRNGYALAQVMDTDIYVNPKTYNRAQDIQGIKIITYCSPLYFANSEIF 1637 

TQ + L QV DTDIY + Y+ A++++G+K+ + +YFAN+E + 
Sbjct: 487 TQMPHYSVLGQVPDTDIYRDVAEYSEAKEVRGVKVFRSSATVYFANAEFYSDALKQRCGV 546 

Query: 1638 rqkviak--tvslqelqqdfe-napptdpnnnqtpan-gtsvsyi 1760 

++K++ K + L++LQ++ + P N TS+ + 

Sbjct: 547 dvdflisqkkkllkkqeqlklkqlqkeeklrkqaaspkgasvsinvntsledmrsnnved 606 
Query: 1761 tfspdssspaqseppasaeapgepsdmlasvppfvtfhtlildmsgvsfvdlmgi 1925 

S D A + ++AP + S + A P FH+LILD+ +SFVD + + 

sbjct: 607 ckmmqvssgdkmedatangqedskap-dgstlkalglpqpdfhslildlgalsfvdtvcl 665 
Query: 1926 kalaklsstygkigvkvflvnihaqvyndishggvfedgsleckhvfpsihdavlfaqan 2105 

K+L + + +1 V+V++ H+ V + + G F D S+ KH+F S+HDAV FA + 

sbjct: 666 kslknifhdfreievevymaachspwsqleaghff-dasitkkhlfasvhdavtfalqh 724 
Query: 2106 ardv 2117 

R V 

sbjct: 725 PRPV 728 (SEQ ID NO :4) 



Hmmer search results (Pfam): 

Model Description score E-value N 

PF00916 sulfate transporter family 254.5 1.5e-72 1 

PF00189 Ribosomal protein S3, C-terminal domain. 3.3 8 1 
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Parsed for domains: 

Model Domain seg-f seq-t hmm-f hmm~t score E-value 

PF00916 1/1 187 497 1 328 [] 254.5 1.5e-72 

PF00189 1/1 651 661 79 89 .] 3.3 8 
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1 CTGGGTTCCT ATGTGGGGAG GTCATGCTCC CCACTCATTG AGCCCCCCCA 
51 GGCAAACCAC CTGGACAGCC AGACCCATGC AGACTCTGGA GCAGGTGGAG 
101 AGGAAGAGTG AGACCACCCC GCCTCACGGG CGGTGAAGGG CCGGCAGCCT 
151 CTGAATAGTC TCTGCTAGGA GGTAGAAAGC ACCCTCCCAT CTTAATCATA 
201 GTAATCATCG CCACTACCAT TTACTGGGTG CCTATAAAAG GCCAGCCTCT 
251 TCATACACAT GATCTCACTG AATCCTCATA GCATCTGCCT GCGACTGTTA 
301 TTATCCCCAT TTACAGATGA AGAAACTGAA TCTTTGAACC CAGGTCATCT 
351 GGCTCTCAAA CTTGTGCTGT TTTCCCTAAG CCACCCGGTC TCTCATTTCT 
401 CCCACTGAAA TGTCTCACAT GCCATTGCCC TTACTCATTT CTGCCCATGT 
451 CrCCTCCAAA ACACCATTTA TCAATTCGCT CAACAAGTAT GTGTTGAGTA 
501 CACACTAAGG GCCAGGCGAG GGGCTGGGCA CAGGCGCTGG GGGTAGGTTC 
551 ATTCTCCCAC CTTCGCTTCT GCTGGGTATC ACCTGTGGGG TCTTGCCGGG 
601 CATCCCACCC TCACCTGTAG TTCAAGTGGA CCTTGGGATC CCAAGACCAA 
651 ATGAATGGAA TGCACCAGCC CAGCCTTCAC CAACTTGAGC ACAATCTTAT 
701 TCATAATAGA AACTCACATT TGCATCACAC TTTACATTTT ACACAACCCC 
751 TTCTTATCCA TTAACTCATT TGATCTTCAC AACAACCCTG TGAGATATGT 
801 CTGTTACTCC CACTTTAGTG ATACAGAATC TGAGGTTTGA AAAGTAATGC 
851 TGACCATTCT GCCTCATTAA TAAAAGCAGG ATTAACCCAG GCTCCTGGAC 
901 CCTTCCACAA AAGGCATTAA GCAACCTGCT CCCCTCTGAC AACCTCCCCT 
951 GTCACCCAGG CTCTCCTCTG GGAAGTTGGG GGCATCTCTA GCCCCCAAGT 
1001 AGTTACTCAT TTTCAACCCC ATCTCAAATC TTTTGCCAAA CTGGCCACAG 
1051 CCACCCCACA CTCCCCACCT CCCAGATACA AATCCTCACT CTAAGCCTTC 
1101 CCCATCTCTT TCTTCTCTGT COTCTTTCT CTGTGGTCCT CTGAGCAACT 
1151 TCTCCCAGCT CTGGGAGGTA GAGGGGAGGT GGGAGACCCA GTAATTGGAA 
1201 GAGGGAGGGG GAAAGGTTCC TACAGGGAAC TCCTCCGGGC CTCAGGGGCC 
1251 CTGGCACTCA GCTCTGCCCA TCTCAGCTCC TGGAACGTCA GCCAGGTTGC 
1301 GCAAAAAGTG AGGAGGAGAG GAGCGGCAGT ACACAAGGGT GGGGGAAAGA 
1351 TTAGGCACAG GAAGCCGTGG GAGAGAGAGC CGGCAGGTGG ACCATCCTGG 
1401 TTTCCCCACA CACACCATTG TCCCCCTGGG AAACCTGTTG GTGAAGTTCT 
1451 AGATGTCTTA TCCAAGAAGG GTCCTCTTGA GGTCATCTCA GCTATCCCCC 
1501 TGCCTCTAGG CAAGCTGTTT TCTGTTTCTT CCAAGCTGAC TGGCTGAATG 
1551 GTAGGAGCCT TTCTGCCAGG GAAACTAAGG TCTGGGAAGG GAGTATGGCT 
1601 TGTGGGGACA CCAGGGGTCA GGGGAGGGGA GGGTCCACCT GCTGAATCAA 
1651 GTGGGGCCTC CTGCCCTCGT GATTCCCCTT TGCCTGGTGC TCAGTGGGGG 
1701 TGATGGTGAC GCCACAGGTG TGGAGTGCCA GCCACGTGCT GAGCGCCAAG 
1751 CAAAACAGCC AGGGTGAGTC TATGCATCAT CAGTGCCTGG GAAGGAAGGC 
1801 CACTGCGAGC AGGGAGTCTG ACGGAAAAAC TTGACAGAGG GAAGGGAGGC 
1851 ACCTTGCTTT ATCGGGGCGG GGAAGGCCAG AATAAAACTC TGCTACTGCA 
1901 AGGACCAGAG AGAGAAGGCC TGGGCTGGCA CTAGGGAGGG ATGTTCCCTC 
1951 ACCCTCCCCT CCTCTGCTTC TCCCAAAGCT TGTAAATGCC CCAGATATGA 
2001 GCCAGCCCAG GCCCCGCTAC GTGGTAGACA GAGCCGCATA CTCCCTTACC 
2051 CTCTTCGACG ATGAGTTTGA GAAGAAGGAC CGGACATACC CAGTGGGAGA 
2101 GAAACTTCGC AATGCCTTCA GGTAACTGGT CCAGAGCCCA GACTTCTGCC 
2151 TCCTCTGCTC CCTACCAAAA TCCTTTCTGC ACCAGGACAC GGCTTCTGCA 
2201 CTGGTATCCC TAAGATGGGG TTAAGGGAAG CCCTGGGGAA GTGAGGTTCT 
2251 GAATGATGAA TTTAAGATCC TACAACCTCA TCTGTACTGA GACCCCCAGG 
2301 GAGGATGGGG AGCAGGAGCA AGAACCATCC AGAAGGGTTA TATGGCATTC 
2351 CCAAACCCCT GCATGGCATC TCCCATATTC TCAATTCACC CGGGTCTCTC 
2401 TGGGTTTGTT AAGGCATGGT AGATGAGCAT CTACGTTATG GAGGGGTGGG 
2451 GAGCATCAGA GCCCTTACTC CATGCCCTGT TCCCTCCTTA CAAAAAATAC 
2501 CTGAAGTTAC CATCACCCCA GGTTCTTTGT CCTTTCCCTC CCGGATGTTC 
2551 CTTCCTCCAC TTGGTCCAGA GAATGCCAAA AGGAGGCCCT AAATTTCTGA 
2601 ACTTTCCTGA GGGGACCTAC CAGGGTGTAG TCCTACCAGC GCCCAGGGTC 
2651 TTTCCACTCT CATCTCCCTG GAAATGCGAT GGTGGGTATG AAACCTTGTC 
2701 CCTAAGTAGG CGCTACACAA GGTGATCCAT ACCCACACCC CAGGAGGCTG 
2751 GGGCTGCGGG TGTCACCCTC CCCATTCCCA GACTCCTGGC AGACCTCCTC 
2801 TGGCCCAGCT ATAGGCCAAC TCACTCTCCC TCACTCCCTT GGGGAAACGG 
2851 CTGATTCAGT TACCTGGATT GAGGTCACTG GCAATGGCTG AAGTGGAGAC 
2901 GCAGGTGGAA CTGGTTCAGG CCGGGGGAAT CACCCACTTG AGTTTGTACT 
2951 AAAAGCCCCA GCCCAGCCCT GTTTCTCTTG GGAGGCTCCA TTTCTGCCCA 
3001 GT TACA GTCT GTCCTCACAG CTGTGCTCCT CAGACAGGTG GTCTCTGCCA 
3051 GTCTTTGTGC CCAAGACTTT AGGGCACAAA GTCTGAGGAT GAGAAGATCT 
3101 GCTATTGTCC TAAAAGATTA GGATAATGAA AGCTGTAAAG GGATATAGCA 
3151 AACTAACAAT TCCTATGATA CTGGCATGAG AGCCTTGAAC AGTGCCTGGC 
3201 ATAGAGAAGG TGCACCAATA AATATTTGTT TCATGAATGA ATGAATGAAT 
3251 GAATGTCTAG AAAGCTAATC CCTCTCAGCC TCTGTTTCCA GTTCTTCTTT 
3301 CAAGCTTCAG ATTGCTTTGC CCAACATACA GCAGACTTGC AAGTAAGGTT 
3351 GGGCATGGAC TAGCCCTCAA ATGAGTTGTT TTTCTTTCCC TAGCCAGCTC 
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3401 TCTATTCATA AGTCCGGCTT TCTCTGCCAC AAACAGACCT GATGGAGCCC 
3451 CTGCAGGGCT GGTTCTCTCT TCAAGCAAGG CTTTAGAGTT GCATTAAGCA 
3501 ATTTATCCCC CGTCCACCTC CCCTTCCAGC ATCCCAGGGA TGGCAGAGGC 
3551 ACCCATGAGC CCCAGAAGGG ACAGGGGGTA AGATATTGAT GATGATGCTT 
3601 TTTCTTGGAG TGTTAGTTGG AAGAGAAAAT CTGCCCAGAC TTTCCAAGGT 
3651 ACAAAGCATT GTCTTTGTTG GTTTCAGTCT TGGGTGACAT CCAGGGGACC 
3701 GAGTGTCAGG GAAACTATTG TTGAGCAAGA GCAAAGAGCA GGAATTGGTG 
3751 CTGGGCAGGA AAGGAAGCCT CATCAGAGCA GGCCAGTGAG TCACCAAATG 
3801 GGCCCTAAGT ATTTGAGTTC CCTCAACTGG GAGAAGGAAA GCAAATGCCC 
3851 CTCACCCACT TCCAGTCATC AATCCACCGG CTGTCACCCT TGAGTTTGTA 
3901 AGCCCTTGTT CCTACCGCTC CTGAGTTTCT ATGAAAGGAC CTTGAGGTGT 
3951 TCAACAAACA GGGAAGGGAT CAACTCTCCC CACCCTGCGT TGACCAATGA 
4001 ATTCTTCCCT CCTCTGCTGC CCAGTGAATT AACAGGAGAA AGAACTCCGG 
4051 TATTGGAGTT ACCACACATA AAGGATAGTG AGTCAGCAGA GTGCACCCTG 
4101 CAGGAACAAT AGAGCCTTCC TTTTCAAGGA AGTTCTAAGA AAAATGGCAG 
4151 CAGGCAGGCC CCACTCGGGT GTATTCACTC ATTCATTTAT TCAACAAATA 
4201 TTTACTAAGT GCCCCTGTGC AAGGCTCGAG GTGTACAAAG ATGAACAGGA 
4251 GAGCTAGACT TCTTGCCATG CGTGGTGGGG TTTGCTGCCT AGTGGGAGAG 
4301 ACAGACAAAA AGCAAGGAAT GCACACACAG GATGCACACA CAGCGGCAGG 
4351 AACCAAGGTG CAGTTACCCA GGCCTGGGAT CAGACAGACA GGACTCAGAG 
4401 GAGACTTTCC CAGAGAAAAG CCATCTGAGC CAAGGGATGG ATCTGATACC 
4451 TCCGAAGGCT GAGCCACCAT AACACTCATA CCTTTAAGCC AAGTCTTATA 
4501 AACTCCCCAG GTAAGCAGCT GGCAGTCAGA AGACCTCCAG CTAATGCCCA 
4551 GGACAAGTTG ATGAGCTCTC AAGAAAAAGT TCCTGCCTTT TCTTCTCAAT 
4601 ATCCCTGGCA CACAGTTCAG TGAATTTTGA ATGAACCAAT GAATGAAATG 
4651 AGCAGGATAT GATAATCCCT CTCCAACACG GAATGTCCAA GCCATGCAGA 
4701 GCCGACTGGA AATTTTCCCC GTTCCCTTCC AGATGTTCCT CAGCCAAGAT 
4751 CAAAGCTGTG GTGTTTGGGC TGCTGCCTGT GCTCTCCTGG CTCCCCAAGT 
4801 ACAAGATTAA AGACTACATC ATTCCTGACC TGCTCGGTGG ACTCAGCGGG 
4851 GGATCCATCC AGGTCCCACA AGGTGAAGGG GCTCCTTCAG CCAGGCCTGG 
4901 ATTGCCACTC CCCTCACCAT TCCTCTCCTC ATCCCCACTC CATCCCTCTG 
4951 TGATCCCCAT AAGCTAGTCA TGCTGCTGAG CTTCAGTCTC GTTGTCCTCT 
5001 GCAGGCATGG CATTTGCTCT GCTGGCCAAC CTTCCTGCAG TCAATGGCCT 
5051 CTACTCCTCC TTCTTCCCCC TCCTGACCTA CTTCTTCCTG GGGGGTGTTC 
5101 ACCAGATGGT GCCAGGTAAG GCCTCTCCCC TCTGGGCAGG CAGGATGACC 
5151 CAGACCACAA GGATGGGAGG TGTGGCAAAG GGGCCTCGGG AGATTTTCCA 
5201 TCTGCATTCT CCTGGAGTTG TTCCTGGTCA GTCCTAGGGG AATGGTCACT 
5251 GTGAATGTCA TTTCCAGGTC CTCGGTGACC TTGGAGAAAC CACTGAGCCT 
5301 CTTTGAGTTC AGTTAGCATT ACCTGTTCCA TCTTCCTCCT AGGAATGAGA 
5351 GGAAGACTTA GCAGAACAAG ATATACCATA TGCTATAACA TGCTTAAACA 
5401 GATGTGAGAA ATCACCATCT AACTCCCTGG TTGGTCCCAG CCGGCCACTA 
5451 CAGGGACATT TGGACTTCTC TGGTGCTAAG TGAGATGGAG GAAAGCCTGG 
5501 TCACAAGGGC TGGTTTCTGG TTCAGGCTCT GCTTATATTT CTTATTTCTG 
5551 AGTTCATTTT CTCACGTGTC CTGTATGACA ATATTGACCA TTGGGGTAAA 
5601 AGCACCTTGA AAAGCATAGA TCATGGTTAG AGTGAGTGGT TGTTATTATT 
5651 GTGTTGGAGA AGAGCCTTGG AGGTGCAGGG ATCCATCCCC CTGGGGTCGG 
5701 GAAGCATTCC TGGGCCCCTT TCTGGTTTCC ATCGGTGTGG TTCAAACCTC 
5751 TGAI I I IIGC TGGCTGGGTG GGGCACCACA GGTACCTTTG CCGTTATCAG 
5801 CATCCTGGTG GGTAACATCT GTCTGCAGCT GGCCCCAGAG TCGAAATTCC 
5851 AGGTCTTCAA CAATGCCACC AATGAGAGCT ATGTGGACAC AGCAGCCATG 
5901 GAGGCTGAGA GGCTGCACGT GTCAGCTACG CTAGCCTGCC TCACTGCCAT 
5951 CATCCAGGTG AGGGGGCAGC CCCCAACCCT GCTAGAAGGG CATCAGAGCA 
6001 CCCTGCCCCT CCCTCAAAGC CTTAGCTTTG ATGCTAAATC TGATTTAGGG 
6051 GGCTGGGTGT GGAGGCTCAT GCCTGTAATC CCAGCACTTT GGGAGGCTGA 
6101 GGAGGGTGGA TCACTTGAGG TCAGGAGTTT GAGACCACCT TGACCAACGT 
6151 GATGAAACCC CATCTCTACC AAAAATACAA AAATAATCCA GGCTTGGTAG 
6201 TATGCGCCTG TAGTCCCACC TACTCAGGAG GCTGAGGCAG GAGAATCACT 
6251 TGAATCCGGG AGGCAGAGGT TGCAGTGAGC TGAGATCGCG CCACTGCACT 
6301 CCAGCCTGGG TGACAGAGCG AGACTCCGTC TCAAAAAAAA AAAAAAAAAA 
6351 AAAAAAAAAA CCCAAGTTAG GGCTCACCTC CTCCCTCCTC CCCATCCCAG 
6401 GGCTAAAGTG AACCTTGAAA ATTAACAGTA TCTCCTCATC TGCATGTAGC 
6451 AGGACCATAC AAAAAAACAA CAGCTGTACC TGGTTAAACT GTCCTGAGCT 
6501 TTAAACCTGT AAAAGACTCA CAGCCTCTCT CCATTATCCC GTGGAGAAAC 
6551 CCAACTCTCT GCCAGCATAG TCTTGCAGAC TGCTAATTTT CTCTAACATC 
6601 CCTCACTCCG CTCCAGCCTC CTCTGCTCCA AGCCACAGCA GCAGTTGCAC 
6651 AACATAAATT GAGCTTCTGC AAATGGTTGC AAAGGATTCT GCTAGGTTTT 
6701 ATGAAGGGAA GCACAACATG ACAGAATGCA AGAGCAAAAC ACAGTCCCAG 
6751 AGAGCGCCTT TTCATTCACT CATTCATTCG GTTTTGTGCC AAGAACTAGG 
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6801 CTAAACCCTG GGATACAAAG 
6851 TGCTCACAGC CCAGCAGAGG 
6901 GCAGTGAGAT AAGCAGCAAA 
6951 CAGAGCAGAG TCACGGAGGA 
7001 TTAAAGGATG AGAACTTAAC 
7051 GCAGAAGGGA ACAGTGCCTA 
7101 GCGCAGGCAA AGTAAGGGGC 
7151 AGAGGATGGA AAAGTAGGCA 
7201 CTCTGACAGC GACTTTGGGT 
7251 CTACACAATG ATGAGTGGGA 
7301 TGGGAGACTC TGCAGAGGCC 
7351 TAGAGAAAGA GATGGGGCTG 
7401 GGTGATAGAC TGGATGTGGG 
7451 GTTGGGCAAC CAGGTTAATG 
7501 GGGGGAGGAC TAGACTTATT 
7551 GTGACAGAAA GGCCCATGCT 
7601 TGTAGAGCAC AGTGAAGGGC 
7651 CAAGCTACTG CTACGCTAGG 
7701 TGGGGCGGGT CGTTGGAACT 
7751 CACCCTTTTC CTCCTCTCAG 
7801 GTGGCCATCT ACCTCTCCGA 
7851 CGGCCTGCAG ATCCTGATTT 
7901 TCCCCTCCTA CACAGGCCCA 
7951 CACCCCTGCC ATTGGAGCAA 
8001 TGTTTTAAGA TGTCGTGAGC 
8051 TTCAGAGAGG ATTCTGAGGG 
8101 ACTCACAGAG GTTCTTGGTA 
8151 CTCCGCCAGT TCCCCATTCT 
8201 CCTGCCTTAT GTCTAATCTC 
8251 AAAGAAAAAG AAAGCATTCC 
8301 TGGGAAAGGG CGGTGGGCGA 
8351 TCTGTCTTGG TGTGCCTCCA 
8401 AGACACCCAA AGGTGGGGTA 
8451 CATGATGCTC TTTCCCCCAG 
8501 CACACCAACA TCGCCTCGCT 
8551 GGTGCTGGTG AAGGAGCTCA 
8601 CCATCCCTAC AGAGATGATT 
8651 TTGGGGGGCC AGGCTGTGAG 
8701 TCCCCTCAAC TGGCAGCCAG 
8751 CATCCCCCGT ATGTGGCGTT 
8801 CCTAAGGAGG CAGGGGAGTG 
8851 TTGAGAGAGA CGCCTGCCCT 
8901 CACTCTGGGG TTTGCAGGTG 
8951 AAGATGCCCA AAAAGTATCA 
9001 GTGAGTCCAG GTGGCCCAGA 
9051 TAAGGCCTGA GCTCGGAGAG 
9101 TCGAAACTGT ATGACACTGA 
9151 CTGAGAAGGG CCGCACAACC 
9201 GTCTTAACTG CCACACTCCC 
9251 AACAAAGCCT CTCTGAACAG 
9301 GCATTGGTTC TTTGTCCATG 
9351 TCATGAAGCA TCCACTCTGT 
9401 AGGGGTGACG CTTGTCCTGC 
9451 ACATAGCACC AAAGGTCTAG 
9501 CAGAATCTGG 6A6TTAACAA 
9551 COTGGGCAA GTTAGGTTCT 
9601 GGAGTGATGG TCCCTACCCT 
9651 GGGATAACTT AGGCAAAGAT 
9701 CTGTGGGCTG GTGAAGGACT 
9751 TCTGCCTCCT TCCTGGCTCC 
9801 TGGTCTCACA GTGGAAGGAC 
9851 GTGAGCTACG TCATCAACCT 
9901 CGGCTACGAC GTGGATTCGA 
9951 CTGGGCAGGA CAGGTCAACT 
10001 AGATCATTGG GCTGAGGTGA 
10051 GGTCCTCTGA CCCTAAGAGG 
10101 TTTATAGCCA CTGGGTCACT 
10151 ATGCCTGACT GCACTTGGTC 



ATAAGTAAGA AAGAGGTCCA ATTCACAAGT 
AAGGAGCCAT GTCAACAGAT AAATTTGTAT 
GTAGAGCCAT GTACAAAGAC TGTAGGGACA 
CCTCAAAGAG GAGGTGACAC TCCACCTCTC 
CAGGAACAAG GTATACAGAG GATGGTCCAG 
AAAACACTGA GGCCTGAGAG AGTGTGATCT 
TTGGTGTGGC TGGAGGGTAG AGGGCCCAGA 
GGAGCCAGAC AATGAGATCT GGGGTCTGTT 
CTGATTGGCA GTTTATAAGG ATCGTTTGGG 
GGTGGATTAG AATCAAGGCA GGGGACCTGT 
CAGGCAGGAA TAATGCAGGC GAAGACCAGG 
GACTTGAAAA GAATGTTTTA CCAGGAGCTT 
AGGTAAGGGA GGATGACTCT CAAGTTTTTG 
ATGGTGTCAT TTACTGAGAG AGAAAACACT 
TTACAGATAA GCCAAAGCCA GAGAGGTGAT 
CTAAAGGAGC TGAAGGTCTG ATGGCAGCCA 
AGGTGAAGGT CACAGATGGT CCAATTCCCT 
ACTGCACGGA GCTCCAGACC TGCGTGTGTG 
GCTGAACCAC ATTGGTCTTC CGCCACCAAC 
ATGGGTCTGG GCTTCATGCA GTTTGGCTTT 
GTCCTTCATC CGGGGCTTCA TGACGGCCGC 
CGGTGCTCAA GTACATCTTC GGACTGACCA 
GGGTCCATCG TCTTTGTGAG TCTGGGGATG 
GGCTCCAGCA GACACATGAG GAGGATGTAC 
TCCTCATTGC AAGGGCTGGC TTAGCTGTTG 
GGTTTCTGTC TTGGGAGGGT CAAAGTCATG 
GTTAATACCr GCAGAAAAGA GCTGTACATT 
AGTGCCTCAA CCCCTCCCTG CCTGGAAAGT 
CATCCCTCCT CCTTCAGCCC AAACTCTTCT 
TTTTCTAGCA CAAGTTCCCC ATGTGCCTTT 
CGGGACAGGG TTCCTGATCA GGGTTTTAAT 
TTAGCTTTGA TGGCATCCCT TCCCTGGGTC 
TTATGGGAAG AAGGGGTGGG AGGCTGTGAG 
ACCTTCATTG ACATTTGCAA AAACCTCCCC 
CATCTTCGCT CTCATCAGCG GTGCCTTCCT 
ATGCTCGCTA CATGCACAAG ATTCGCTTCC 
GTGGTAAGGA CCTTGTTCAG AGCTGGGATG 
ACGAGGAAGC CCCTACCTTT CCTCACCCCA 
TGGGACAGGA AGTCAGTTGT GAATCCATCC 
TCCTCTCTTT CTACTGCTCT AATAATTCCC 
GGATTCAGGG TCCCCAGAGA AAAGGGAGAC 
GGCCCCACCT TAGGGCCAAT CCCCATTCTC 
GTGGTGGCAA CAGCTATCTC CGGGGGCTGT 
CATGCAGATC GTGGGAGAAA TCCAACGCGG 
AGCCTGGCCC ACCCGCACCT CATGCCCCAC 
GGAGACAAGA TGAACTCTAT GAAAGTGCAG 
CCATGTATGA ATTATTACTA TTACCGTTTC 
AGCCAATGTA GGCTATTTTA TGAGAAATGA 
CTTATAAATC TCATTCAACT GATGCTGTTA 
CCGCTTGCTG GCTCTTTGCC TTGCTCTAAT 
TAGAAAGGGA ACTATTAGGT TCAACCAGAT 
GCCAGGCACC ATGCTGGGCC CTGGGAGGAG 
AGGGTTGGAA CAGGCAAGGG AGGGAAGACC 
GGGTCTGTGG ACTCGTGAGC ATACAGGGTT 
ACGAGGCCCT ACCACATACT GGCCCG6GGA 
CTCAGCCTCA GTTTCCTCCT TTGTAAAACA 
AT6GGGTGGT GCTGAGGATT CAGACTGGAT 
CCCGGCACAC CATGGGGGCC TGGCTGGTCC 
TG6CTGCCCT CCCCACTCAC ACCCTTGGGT 
TCGGCAGGTT CCCCACCCCG GTGTCGCCTG 
ATGATAGGCA CAGCCTTCTC CCTAGCCATC 
GGCTATGGGC CGGACCCTGG CCAACAAGCA 
ACCAGGTAGC TCTGGCCACC CCCGGCAGGA 
CAGGCCTGGC ATGACATATC TTGGGTGGGG 
GGCAGGCTGC CTCGAGTGTG GGGGATAGGG 
CTGACCTCCT CTTGACTGGG AATGTGTGAC 
CTCAGGTCTT AGGCCCACAG TCCAGCTTGC 
CCCGTGCCCC CCAGCCCCAC ACTGGCTTCT 
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10201 AATCCTGTCC CCTCCCTGCA 
10251 CTTTGGCTCC TTCTTTAAAA 
10301 CTCTGGCTGT GGATGGAGCT 
10351 GGGGAGTTGG GGGGAGGTGG 
10401 GCACTGCAAG CCAGGCCAGC 
10451 TCTCCACrCC ACCAAAGCCA 
10501 TCACACCTTC CCTCTATCCC 
10551 GTGTGTGTCT CTGGTGGTGA 
10601 TGTATCCTCT CCCTAAGGTA 
10651 AAGACTCCAA TAAGAACAAT 
10701 ACAAAGTGCC ACTGTTGTCA 
10751 GACAATGCAG GTTTTATCCT 
10801 TGAGAGTCTA AGTAACCTTG 
10851 TGGTCCCAAA CTCCAGCCTT 
10901 TTTGCACCTA CCCACGAGTA 
10951 CTCTAGGCAT GCGTCATTTA 
11001 GGGGCCAGGA AGGACCCCAT 
11051 GGTCGAATGC AGAGATTTGG 
11101 CCGTCTCTGT CCTGCCAGGC 
11151 AAGAAGGGGA GACCCTGGGG 
11201 GTGCTAGGAG CCCTGATCGC 
11251 CACCGACCCC TACTACCTGT 
11301 TCGGGCAGCC TCTGGGTACT 
11351 ACAGCCCTGT CAGCCCTGTC 
11401 CTAATTAGCA TGAGATGAGT 
11451 TCCTATGTTC AAACCCTAAC 
11501 CAAGCCCCTT AACCTTTCCC 
11551 ATTTTATGCT CTCTGCCTCC 
11601 ATACAAGTAA ACATGTTGTC 
11651 GCTATGACTA CTTTATGACA 
11701 TGTATTTTAA AAAAAAACCT 
11751 AGCAAGTTAC TAGACTTCCC 
11801 GGTGAGACTC ATGCAGTCAT 
11851 AATGAGATCC CAGTGGGAAA 
11901 ATGACTTATT GTCCTGATTA 
11951 CATAGTAATG AAGGCCAGTG 
12001 AAGCCCCACC ATAACCTATG 
12051 GATGAAACCA AGGCTCAGAA 
12101 GTAGGGAGTG GTAGAGTCTG 
12151 CTCTAGGCTG TACTGTCTCA 
12201 GAGAAGAAAA TCCTTAAAGC 
12251 CCAGAGAGAC AGTGATGAGG 
12301 GCCCAAGGAT CTGCTTTGAC 
12351 ATTTCCTTAG GTTCATATGG 
12401 AATTCTCTAC AGGAATCTGG 
12451 GGTCTTAAAA TCTATAGGGG 
12501 CCCTACCCAC ACACAAAACC 
12551 AGCTCCTGTC AGGGCCCACG 
12601 CACATTTATT GATCACCTAC 
12651 CATTCCTTAA CTTTTGAATC 
12701 TTTGTTTCAC ATAGCCATTA 
12751 CTCACACCTG TGATCTAGCA 
12801 GAAGTCAGGA TTTCAAGGTC 
12851 AGAAAAATAT AAAAGTTAGC 
12901 ACTATTCAGG AAGGTTAGGC 
12951 GCTCCAGTGA GCTGATCTTG 
13001 AGACCCTGTG ACTCCAAAAA 
13051 AGATCTGACC CAAGATGCAT 
13101 GGGGCTTCTA CTAAAAACAC 
13151 AGGGAAAGAG GAAAGTGTAA 
13201 AAT6CTATTT AAAGAAAAAA 
13251 CAGAGAATGG CCTTGGCTAA 
13301 TTTCCCCAGG TCTGCTTTTG 
13351 GTGAGCTTCC TCTCCTCCTT 
13401 GGGTGTCGCC TTCTCCGTCC 
13451 TGATAGCTTC CGCCCTCCTA 
13501 CAAAGGGCTT CCATGCCACG 
13551 CTGGGCCTGG CACTGGAGGT 



GGAGATGATC GCTCTCGGCT GCAGCAACTT 
TTCATGTCAT TTGCTGTGCG CTTTCTGTCA 
GGAGGAAAAT CCCAGGT6AG CCTTGTTCTA 
TAAGAGAACA GTTGCCCCAA AAAAGCCTGG 
TCTTCTCCGA CCCCTTCTTC CCGTACTTAG 
TGGATTGGAA ATAAATCAAG AGCAAAAATT 
CAACTCTTTC TCGGAATAGG TGGCCAGCCT 
TGATCACCAT GCTGGTCCTG GGGATCTATC 
AGAGCCCAGC CATCGAGCAG AAGTCAAC6A 
CCCTGAGAGT TGTGTGGCAC TTTACGGACC 
TACTTAGTCT CAACCACAAA CTGTGAGGTA 
CCCCATTTTA CAGGTGAAGG AAACTGAGTC 
TCCATAGTGA GGCAGCTTAC AGCGCAGGGC 
CTGGCCTCAG AGTCTAATCC CTAGGCAACA 
CCAGGCTCTT ATATAGCCCA GCTAGGAGGG 
GAGATGAGGG AAGAGAGATA GGGAAAGGAT 
GGCTCTAACG CCAGCACTTT CCAAACCTAA 
GGGATCAGCC AGGGGAGGTG TTCCAGAACT 
CTTGGGGTCG GGTATGCGCA GGAGGGCAAA 
TCCTGGAGCA ATGTTCTGCT TCTCTAGTCT 
TGTCAATCTC AAGAACTCCC TCAAGCAACT 
GGAGGAAGAG CAAGCTGGAC TGTGTAAGTA 
GGCCATGCCC CTGCCCTCTC CTCCAACCCC 
CTAACAATGA ACCCTCTAGT CTGCTGCTTC 
GGTTAAAAGT CCGAGTTTCG AAGTGAAACA 
TCAGCCATCT GCTGGCTCCA TGGCCAATAG 
AGTCTTGGTG TCTTAACTGG GCAAATGGTT 
CAGGGTTTTC TATGAAGAAG AAGCAAGGTA 
TACATCGTAT TTTATACTCA ATAAAGCTTA 
TACAGCTTTA AAAAACAAAA GGAAATAGTT 
AGAACATAAA GCCAGAGGAC CAAAATCTTG 
TGGGGTTCTA TTTCCTCATC TGTAAATGGG 
GGTTGCGTCA AACGCTGGTT CCGAGGATTA 
ACACCGCATG AGCGCAAACA TTCTGCAAAC 
GTCACACACT CCACCGCATC ATCCGCTGGG 
TGTTTTGACG ACACTGCCTT CTCTCCATTT 
GGAGAGGATT TACTAAACTT TCTTAACGGT 
TGGTTAAGTA AATTGTCAAA GGCCACAGAG 
GATTAAAACT CCAAGTCCTG GACTCCAGAC 
TAGGGAAGGC AGTCTCACCC ACCTAGGGCA 
CAGAGAAGTG AGTGGCTCAT CTGTGGTCAC 
ACAGGGAGAA AAATTATACC TCAGTTCCCA 
CATAACCCAA CAAGCCCCCG CTATGGTGGT 
CGGCTTTTGT TTCCATTTGA TCTTCACAGC 
GCAGATTTAT TTCCTTTAGA GGAATTTCCA 
GCAACTATCA AAACTTCACC CAATGTTGCC 
AGGCCCCCAG CCGATCAGAA AGCACTGCTG 
CAGCrCGCTG TGAGACAGAG AGAGGGAACT 
TGAGCATCCA TCACTAGGCT AGGACCGTCA 
CTTTCATGAG GTAGGCATTA TTATTCTCCT 
AAGAACAAAA TTTGGGGCTG GGTGTGCTGA 
CTTTAGGGGG CTGAGGCAGG AGGATCGCTT 
AGCTTGGGCA GCTTAGCGAG AGCCGTCTCT 
TGGGTGTGGT GGCACGTGCC TATAGTCCTA 
GGGAGCACAA CTTGGGTTCC AGGGTTTGAG 
CCACTGCACT ACAGCCTGAG CAACAGAGCA 
CAAACAAACA AACACATTTT GAACCCAAAC 
GCTCTTATAG ATGCCACCTC CCTGTGTGCT 
AGACAAGATC AGGCAACCAC AGTCAATCTA 
CCAAAGCACA AATACATAAA ATATTGCAAA 
AAGAGAAGAG AGGCTCTGAG GTTGTACTAA 
TCCAGGAAGA CTTCCTGAAA GAGGTTGTTT 
ACATCrCTCT TTTCACAGTG CATCTGGGTA 
CnrCCTCAGC CTGCCCTATG GTGTGGCAGT 
TGGTCGTGGT CTTCCAGACT CAGTTGTAAG 
GGCCCACAGT CGGTTCCCTG GGCCAGCCCG 
GCCTGGCTTA GTCCACTGTA CCTTCCACCT 
GCTGCCAGGC CCAAAGAGAG CCCAACCCAG 



FIGURE 3D 



Docket No.: CL000861CON 
Serial No.: TO BE ASSIGNED 
Inventors: Kari GUEGLER et al. 
Title: ISOLATED HUMAN TRANSPORTER.. 



13601 CCAGGACTGT GGGCACAGTC 
13651 CCCCAGAGAA AGCCAGCATA 
13701 GTGGCAGAGA AAGGAGGGCA 
13751 AAAGACATTG CCAGAACCCC 
13801 GCCAGGGTGG AAGACAGAAG 
13851 GAAGGGGTCT TGGCCAAGGC 
13901 ATCTTCTCTT CTTGAGAACT 
13951 CTCCCACCAC TTCTCCCACC 
14001 CTATGTCAAG GACCTGCCTG 
14051 CACCAGACAT GTGTCTCCAG 
14101 GTGAGGACAG GACCCCATCC 
14151 GCACCCTTCC AGGCCCATGT 
14201 CCTGCTTCAT CCAGTCGAAA 
14251 TGACATTTAT GTGAATCCCA 
14301 TATGACCTCC TTCTTTTGCT 
14351 AAAGCCAATA AAGACATTTC 
14401 GCCCAGGATA TCCAGGGGAT 
14451 CTTTGCCAAC TCAGAGATCT 
14501 TCAGTCCCTG GCGACCAGAG 
14551 AGCAGAAGGG CGGTGGGAGC 
14601 GGTCAAGATT GCTGTTGGCT 
14651 AAAGTATTAC TAGCCAAGCA 
14701 AATGAGGCCC ACACAACAGA 
14751 ATGAAGGCCA GAAGCAGCCC 
14801 TGCCCCCTGT TACTCATGGT 
14851 GGCAAAGACA GAAAGTCTCT 
14901 I I IL I I LI I \ I I lil C II II 
14951 ACCCAGGCTT GAGTGCAATG 
15001 TCTGGGGTTC AAGCGATTCT 
15051 ACAGGTGCCA CCACACCCAG 
15101 TTTCACCATG TTGGCCAGGC 
15151 ACCTACCTTA GCCTTCCAAA 
15201 CCCAGCCTGG GCAI I I I ICT 
15251 AAGCCACTGA ACCCCCAAGG 
15301 ATGGCCCATG GACTTTTGAA 
15351 CTGGGGAACC CAGGCAGCCC 
15401 GTAATCCTTG TAATCCCAGC 
15451 CTCATGAGTT CAGGAGTTCG 
15501 GTCTCTACTA AAAATACACA 
15551 GTAATCCCAG CTACTCAGGA 
15601 GAGGCAGAGG TTGCAGTGAG 
15651 GCAACAGAGG GAGACTCTGT 
15701 GAGGGGCTTT CTCCCAGAGA 
15751 GGCTGGGTCC TCTGACTTTG 
15801 CCTGCCTAAG AAAGGGCTTT 
15851 GTCTGTGTTC ACCATCACCG 
15901 CCATGCCCCT GCTTCAGTCA 
15951 TGACTCTGCC CACTGTCCCC 
16001 GCAGCAGGAC TTTGAGAATG 
16051 CCCCGGCTAA CGGCACCAGC 
16101 TCCTCACCTG CCCAGAGTGA 
16151 GCCCAGTGAC ATGCTGGCCA 
16201 TCATCCTGGA CATGAGTGGA 
16251 GCCCTGGCCA AGGTGAGGCC 
16301 CCCCTCCGCr CTGCTCTCCA 
16351 GAAGCTGAGG GAGGAAGCTC 
16401 CTCCTTTCCC CAGACGCCAC 
16451 CAGACGGCCC ATGACAATGA 
16501 CCCCTGTGCT CCCAACACCC 
16551 CCATCAGGAA CCCACCCAGG 
16601 GTTCATGATC TGCTGCCCTT 
16651 GAGAAGAAAT AATACCAGTG 
16701 ACGGGACACC TTGAATGAAT 
16751 CGCCAATGGC GGGAGTCAGC 
16801 GCCAGGGGAA CAATGGGCAC 
16851 CCAGGCGCAG GGCAGGCCCC 
16901 TTAGCCAGGA TCTGGAGTCC 
16951 CTGAGCCTCC ATCCTCCCCT 



TGGGCTGTTT GACTTCCCAT ATCTTGAAAA 
CTCTTGCTGG GGATGGCTGG GGAGAGGGCA 
AGGGCAGGTG GTGAGATTCA ACATCCTTCC 
AAACCAAATG GGACCCCACC CCAGGAGAGC 
CTGTGTTCTA CACACTGGGA GTATTACAGA 
AGGGAGTACG CTGAATGTTG GGGGAATCCT 
CAGAACAAGG AAATGATGAC TTCAGGGCGA 
ACTTCTCTCC CCTGCCCTGT GGTCTGGGAG 
TCATCCTCAT AGTTATAGGA GGCCACAGGC 
TGCAAAAAGA CAGACACAGC AAGTCTGGGG 
TACCTTGGCT CTGCCCCCGC CCCAGCAGGG 
GCCATTAGCA TTCTCTTATG I I I I i CTCTT 
TGGCTATGCA CTGGCCCAGG TCATGGACAC 
AGACCTATAA TAGGGTAGGT AATTCAAGCT 
CTGCACCACC CCAAGAAGAG GTTGCI 1 I I I 
TGCAACTTGA GCTCAGTCTC CCTGTCACAG 
TAAAATCATC ACGTACTGCT CCCCTCTCTA 
TCAGGCAAAA GGTCATCGCC AAGGTAAGGC 
GCTCTGGACA GAGAGTGGCC GGAAAATGGA 
TGAGAATAGG CCACTCCCAT AGAGGGTGGA 
CTCTCCCTGC AGACAGGCAT GGACCCCCAG 
AAAATACCTC AAGAAGCAGG AGAAGCGGAG 
GGAGGTCTCT ATTCATGAAA ACCAAGGTGA 
CGTGCCCTGC TCTCCTGCCC ATTCTGATAC 
ACCCTGGGGG CCCCGCTTCC CACCCTGACA 
GGGAACACTG CCTGGTGGCC GCTGGGCATT 
TCTTTTTAGA GATGGAATTT TGCTCTTGTC 
GCGTTATCTT GGCTCACTGC AACCTCCACC 
CCTGCCTTAG CCTCCCAAGT CGCTGAGATT 
CTAATTTTTG TATTTTTAGT AGATATTGGG 
TGGTGTCAAA CTCCTGACCT CAGGTGATCC 
GTGCTGGGAT TACAAGCCTG AGCCACTGCG 
TCTTGGATGA GGTGCTACCA TCTCCCAGGG 
CCCTTCTCCA TTTTCTGGCT AAGATAGGAC 
CAACCCAGAG GGGGAACAGC AGTGAATTTC 
AGGGCTAGCA AGGCTGGGGT GGCCATGGCA 
ACTTTAGGAG GCCGAGATGG GAGAATCACT 
AGACCAGCCT GCCCAACGTG GCGAAACGCT 
AAAATTAGCC AGGCGTGGTG GTGGGCACCT 
GGCTGAGGCA CGAGAATCAC TTGAACCCGG 
CCGAGATAGT GCCACTGCAC TCCAGCCTAG 
CTCAAGAAAT AAAGGAGCTC AGTGTCCCCG 
GAGTGGGCTT GAGGCTTCAG TGCCTCTCTT 
TCTGGGTTGT AGGAGACCAA GTTTGCAGGC 
GGGAGAGGCC TCTCTGGTGG AGCTTTCAGG 
AGGCGAGTTA TTCCCCTACA CCTACACCCT 
CAGCAAGGTC TGGCTCAGTC TGGTGGTCCC 
ACCCTTCCAG ACTGTCTCCC TGCAGGAGCT 
CGCCCCCCAC CGACCCCAAC AACAACCAGA 
GTGTCCTATA TCACCTTCAG CCCTGACAGC 
GCCACCAGCC TCCGCTGAGG CCCCCGGCGA 
GCGTCCCACC CTTCGTCACC TTCCACACCC 
GTCAGCTTCG TGGACTTGAT GGGCATCAAG 
CTCGGGGACA GCAAGCACCA CCCACTCCAC 

CATTCCCmr cctgggagcc ctcatttcag 

ACTGGGGAGA CTAACAGCTC CTAGGAATCC 
CAGGTTGAGA CATTCTCCAC AGAGCAGGCC 
GTGGCGGGAC AAGTCTACCA GAGTTTCAGG 
CCAGCAGTGG CCATCCCAAG TCCCTCTCAG 
TTCTCTGAGG AGGGTCCAGT TTGGCTCCTG 
GTCCCTCATT CACCAGCCAC CCTAGGACAG 
CCCCACACCA TCAGGCCAAA CAGAGAGCCC 
GTATCCATCT GATAACTTTC CAGCAGCCAC 
AAACCTCAGA GCTGGCTCAG ATAGAGGCAA 
AGAGAGTGTT CGGACTGCCT TCACCATCAA 
ATACCCAGCC TTGGGCCTCA GCCGGCTTCC 
AGGCCAGCCT TGGCTGAAGC TCTAGACTCC 
GCAGCTTCTG TCTGAAGCCA CAAAGAAGTC 
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17001 TGAGAATCTA AGCTACTGAA AGAAAAGATC AGCCGGGCGT GGTGGCTCAC 
17051 TCCTGTAATC CCAGCACTTT GGGAG6CCAA GGCAGGT6GA TCACAAGGTC 
17101 AGGAGTTCAA GACCAGCCTG GCCAACATGG TGAAACCCC6 CCTCTACTAA 
17151 AAATACAAAA ATTAGCCAGG TGTGGTGACG GGCCCCTGTA GTCCCAGCTA 
17201 CTCGGTAGGC TGAGGCAGAG AATTGCTTGA ACCCAGGAGG CGGAGGTTGC 
17251 AGTGAGCCAA GATCGCGCCA CTGCACTCCA GCCTGGGCAA CAGAGTGAAA 
17301 CTCCATCTCA AAAGAAAAAA AAAGAAAATA TCTAGCCCCA CAAGAAGGGG 
17351 CCATGGTGAC TTTAAGTGCC CGCCACGTTG GCAAAAGTCC ATTTCCGCTC 
17401 CACTTCCCAG AGAAACCGTC AGCCAACACT CCAGGGAGAA GTGGTGTGCT 
17451 TTGCTGCTAT TTTTGTCTTT GGCTGCTGGG CTCTCAGGGT TGCTTATTTG 
17501 TTTGGCTTCC CCTCTGAAGT ACGTTTTGTG AATCACI I I I GAGACCCACT 
17551 CAGAACATTC CTTTCCI I I I GCCTCCCTAC CCCAACAACA CTTCTAGCTG 
17601 AGCTCCACCT ATGGGAAGAT CGGCGTGAAG GTCTTCTTGG TGAACATCCA 
17651 TGGTAAGAGA AAGAGGACAT TTAGGGACTG AAAGACTG6C AAGGAGTGTG 
17701 GGGTAGGAAC AGGTTGGTGG GGTCTGAATA GTGAGGAGGT TGGAAACGAG 
17751 AGCACCCAGC TATCCCCCAC AAGCTGCTGC CTGCTCATAA AAGCTTCAGG 
17801 TACAAGTCCA AAGAGACTGG TCAGATTGCA TAAACATCCT AGGGGCCTTA 
17851 GTGACAGAGT GGGGGTGAGG AGGTCATGGA GTTACAGAAG GACAGCTAGG 
17901 ATTCTAATCT ACCCCATAAC TAATTTGCCA CGTATCCTTG GCCGAGTCAC 
17951 TTTATCTCTC AAGGGATCTA TTTCTACCTA TGTAAAACGA GAGGGTTGAC 
18001 TAGATGGATT TGGGGATCCT CTCCCAATCA GAAACTCTGT GAATCGATAT 
18051 AGGCATAGAG CACACGGTAC CCTAATTCCC CAGGGAACAT ATAAATATGC 
18101 AGTTTTGTAG GCATACAGCC TCCAAAGGGT GCATATACAC AGCCTCAAGG 
18151 ACGTGGCCAC AGGGCAGCAG ACATTTACAT GACTAGCATG TACGCAAAGT 
18201 GCAGAGATGT GGGAGCAAGT GCACACAGAC ACACAGGAGA ATGTGAAGGG 
18251 GCACATACAC ACACACCCAG CTCCCTGCAC TGGGTCAGAC CCCCTCCAGC 
18301 AGGGCTGCAG TTCCCAAGCT CCGCATGGCC ACGTTCGGGG AGAGAATCTG 
18351 CAGTGGCAAT GACCTGCTAT GATATGTTCT GGAGTTAGAA GCAGTGGATT 
18401 CTCCCCAACC TCACTGGACA CCCCCTTAGG AAACCATCTC TAGGATTAAG 
18451 AGTAATCCAC ACAAACTTCC AATGCCACAC ATTGGAAGTT GCTGGAAAGG 
18501 TCTGGGAAAA CAAGAGGAAG GATGGGTCCT TGGGGGATAG AACTGGCAGC 
18551 GGCCTCTTCA AGGATGGCTT AGGCTTTTCC ACTCGAATCA CCACAAAGTA 
18601 CTGACTCCCT AAATCAAACT GCTTCCTTCT GCTCTGGGTT GAAACTTCAG 
18651 CATCCTCAAG TTCATGTTGC CCTCTGCCGT CCAGAACTGA TATTGCACTG 
18701 CCAATGCCAT GGCCCTCAGA TACAGCAAGA GCTGGGACCT CAGGCCTCTC 
18751 CCATCCCTGC TCTGGTCTCA CTATCTTCCC CACCCCCAGC TCCAATCCAC 
18801 AATGGCTGTT ATCTTTCTGA AGGTGATCTT TTCTCCTTCT AGCCCAGGTG 
18851 TACAATGACA TTAGCCATGG AGGCGTCTTT GAGGATGGGA GTCTAGAATG 
18901 CAAGCACGTC TTTCCCAGCA TACATGACGC AGTCCTCTTT GCCCAGGCAA 
18951 ATGCTAGAGA CGTGACCCCA GGACACAACT TCCAAGGGGT AAGGTTCTTG 
19001 CACCTGGGGA ATCCTAGGCT CCAAGGCACT GAAATAGCAG GACCAAGAGG 
19051 CATTATTAGA AAGAACACAG GAGAAGGTTT AAGTTCCAAT ATCAAGTCTG 
19101 CCATTTCAGT TTTCTGAATC TGTTTCCTTA TCTATAGAAT GAGCACCATC 
19151 AACTAACATT ACCTACCTCT CTGCAI I I II CI I I lATTTT GTTTTAGGGT 
19201 TAAATGATAA TTACATCTTT TGTGTCACTT GAAAGCACTT TGTGTATTGT 
19251 AAAAATTCTT TATCAATATA AGTTTTCTGG TTGCACAAAC ACCCAAAGCA 
19301 T AGTA GAGCA GGCCCACTCT GCTGGCATCG TTCCCTGCCT CCTCCTCATC 
19351 T CTTTC TAAA GGGGGCTTTC GGGAAGGGAG GGGAGGGGAG TAAGCCTACC 
19401 CATTTTAACT TACCGGAGCT TAGAGATTTC AGGCTGGTGA GGGATAAAGA 
19451 GATTGGGTCT GAGTTTTGTC TCAGCI I I I I GACATTTAAT TTACTAGCTC 
19501 AGTAAGTCAT ACAAATGGGA TACAAATAAC ACCATCTAAA ACTCCAGAAG 
19551 ACTGGGGAGT CAGAAAAATC CTACCTCCTT GGGGTCCCTG CCCAGATCCC 
19601 CAGTCATCTC TAGCCCTCAG GGTCCCCTCC CAGCTCAGCT CCTGCCCTTG 
19651 GCCTCCCAAG ACTCTTGTTG TGCCCCAGCC CTGGGTAAAA ACCTCCCCTG 
19701 CC CTCT GTGG GTCATAAGAA AG6CTTTTCT GGCCCTAGAG CAATGATTTG 
19751 CTCTTTGCCT TAAGAGACTG ATGAAGGTGA AACCATCTGT TCTAAGTGCT 
19801 GAAAGACTGC CCAGGAACAC ACAGGGCGCT GGCTCCTGCC CTCCATGCCT 
19851 AGAGGGAAAC CCTGGGGAAA CAACGGGCTT TCCTGCTTCG TGAAATTTGT 
19901 CCGCAGAGCA AAGAGGGAGA TTCTGGAGGA AGCTGCATTA GTTGTTAGTG 
19951 CCCTAATCAT GTTCAGCTAC TCTAGTTGGT ATGTATACTT GATTAGTCAT 
20001 AGCACTTATA AATAATTTAT ATTTTATATA ATATATACTT ACATATTATA 
20051 GACCATTCAC AGATACAAAT CACACACATA AACACACACC TTTTCAACAG 
20101 CATTGTGAGG GACAAAGCAG GCAAAGTGAG GCTGGTTATC AGACTTTAAC 
20151 AGATTAGAAA A TATATT CCC AGGAGGACAG 6AATTCCCCA AGGTCAGGCA 
20201 GCTAGCCAAT AGTTTTTCTA AGCTGAGTAA AACCTTCCCT GCCTCTAACG 
20251 GCCCACAAAG GAGGGAAGAC CGCGATACAC ACCTGTCTGG TATAAGGGGG 
20301 AAGACCACAG CCGTGCTGTT TTTGTGAGGC AGGTAAGGGA AGGGGCAAGA 
20351 GGATAAGTCA TGTGTCAGGA AGCAGCGTCC AACCAGAGCC GGCCACCTGT 
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20401 CCCI I I ICCT GCCACCATGC ACCAACTTTG CTGTTCAGTC ACTGAAGCTC 
20451 ATTCTGCACT GGCTTCCTCC CTTCCAGGCT CCAGGGGATG CTGAGCTCTC 
20501 CTTGTACGAC TCAGAGGAGG ACATTCGCAG CTACTGGGAC TTAGAGCAGG 
20551 TGAGCTGAGG GAAGGGGCTG TGAGGGTGGG AGCAGGGCGA AGAGGGGAAG 
20601 GATGGGGTCG CTGTCAAATA CAAGGCGTTC ACTCAGCTGT CTCACCTCCA 
20651 GCCCAGA GCA GTCACATTCA AGGCCACAAA GATTTGTGGT CATCTTTGTT 
20701 TTTTTTCTTT TCCTTTTCTT TTTTTTTTTT TTTTAATTTG AGACAAAGTC 
20751 TCACTCTATC ACCCAGACTG GAATGCA6TG GCATGATCTC AGCTCACTGC 
20801 AACCTCTGCC TCCCGGGTTC CAGAGGTTCT CCTGCCTCAG CCTCCCGAGT 
20851 AGCTG GGAC T TCAGGCCTGC GCCCAGCTAA Mi ll GTATT TTTAGTAGAG 
20901 ACAGCTTTTC ACCATGTTGG CTGGGCTGGT CTC6AACTTC CGATCTCAAG 
20951 CAATCTGCCT GCCTCGGTCT CCTAAGTGCC TGGATTACAG GCATAAGCCA 
21001 CGATGCCTGG CCTTTGTTTT CATTCTTCTC ACTCCCTGAA AGGCATCGTG 
21051 GGGAGAGGGT GAGTCACTGG ACCAAGTCCT AGA6AACCAG TATCTATTCT 
21101 TATTCTCCAA CACATCACCC ACGTGACCCT GAGCAAGCCA CATACACCCT 
21151 GGGCCCTAGT TTTTATCATC TGTGAAATTA GGGGAAACAT AGGTAATACC 
21201 TGTCCCATCC ACCACACAAG ATTGGCAGGG CAGTCACTTG TTCTTTCATT 
21251 AATTCAGCAG GTATTTATGG CGTACCTACT GTTTGCCTGA CACAGTTCAG 
21301 GATGGGCACA TAGCAGTGAG CAAAACAAAG GCCTCTGCCT TTTAGAAACT 
21351 TACGTTATGG TAGAATAGAT GGATTTNNNN NNNNNNNNNN NNNNNNNNNN 
21401 NNNNNNNNNN NNNNNNNNNN NNNNNNGTCT ACAAATGAAT TATTATTGCA 
21451 TGTGGACAAG CCTTAAGAAC TAAAAAATAT GTGGCTGGGT GCAATGGTTC 
21501 ACACCTGTAA TCCCAGCACT TTGGGAGGCT GAGGTGGGCG GACCACCTGA 
21551 GGTCAGGAGT TTGAGACCAG CCTGGCCAAC ATGGCGAAAC CCCGTCTCTA 
21601 CTAAAAGCAC AAAAATTAGC CAGGCGTAGT GGTGCATGCC TGTAGTCCCA 
21651 GCTACTCGGA AGTCTGAGGC ATGAGAATCA CTTGAACCTG GGAGGCAGAT 
21701 GTTGCAGTGA GCCGAGATCG TGCCACTGCA CTCCAGCTTG GGTGACAGAG 
21751 CTAGACTGTC TCAAAAACAA ACAAACAAAA CAAAACCTAA AAGATATGTG 
21801 GATATGAGGG ATCACCATCC CCATAGGGCC CCTGGATTAA CACCACCCCA 
21851 CCAATGCCCT GAATTAAAAG AAACCAGATG ACTAGGTTTG GAGAAATCTG 
21901 GCTT TGGGTC TATGAGAAGT AGTGTCTCTC TTTGTGCCTC TTCCCATTCT 
21951 TTTTGACATT GAGCTCCATG GTGCTCTGAA TCCGTCTCTC ACAGTGCTGA 
22001 TGGCAGGTGG GACAGATTAG AAAATAGAGC TGGAGCCACA GAGATTTGGC 
22051 AGACTGATTT CGGTGCCCTC TTGGAATCTC CAGCACATTC CAAAAAGCCT 
22101 GGATAGGACC AAAATA GCTT ATCAACGTGA GAAAGGACTT CAGAGCTTGT 
22151 CTACTGCCAA CCCTCATTTT ACCCAATGAG GAAAGTGAAG CTATTAGGGG 
22201 GCGAGGGACA CGTGGAAGGT CACACAGCAC ACAGGAGGTG ATTCACATGT 
22251 AGATTTCAGC ACCTGCTCCT GCCACGCTGG ACTGGTTCAC CTCCTAGGCT 
22301 GACCCTGCCT CTCCCCTGTT CACACACACT CTCGCACACA CACACACACA 
22351 CACACACACA CACAGGTGCT TTGTTCTGGC CAGGGGTTCC TAGGGTCACC 
22401 TCTTGGTTGC AGCCACTGTG ACCCCAACTG GTCTAACCTC TCTCTTCCCC 
22451 TCCCACTTCC TTCCTGTGGT TCCTGCAGGA GATGTTCGGG AGCATGTTTC 
22501 ACGCAGAGAC CCTGACCGCC CTGTGAGGGC TCAGCCAGTC CTCATGCTGC 
22551 CTACAGAGTG CCTGGCACTT GGGACTTCCA TAAAGGATGA GCCTGGGGTC 
22601 ACAGGGGGTG TCGGGCGGAG GAAAGTGCAT CCCCCAGAGC TTGGGTTCCT 
22651 CTCTCCTCTC CCCCTCTCTC CTCCCTTCCT TCCCTCCCCG CATCTCCAGA 
22701 GAGAGCCTCT CAGCAGCAGG GGGGTGCTAC CCTTACAGGA GTGAGAGTCT 
22751 GGTGAGCCCA CTCTTCACCC GTCAGGCCCT GGCCGCAATG GACAAGCCTC 
22801 CTGCTCACTC CACCCC ACCC ACCTCTGCCC TGTCCTTGGC AGCTGAAGGA 
22851 CACCTTGACT TCCAGCI I I I ACGAGTGAGC CAAAAACAGA AGGACAAGTA 
22901 CAACTGTGCT GGCCTGCTGT ACAAGCTTCA AAAAGTGTCC CAGAGCCCAC 
22951 ACGGCTCG6T GTCAGATGGT GTCAGGCTGT CACGGACATA GGGATAAACT 
23001 TGGTTAGGAC TCTGGCTTGC CTTCCCCAGC TGCCTCAACT CTGTCTCTGG 
23051 CAGCTCTGCA CCCAGGGACC ATGTGCTCTC CACACCCAGG AGTCTAGGCC 
23101 TTGGTAACTA TGC6CCCCCC GTCCATCATC CCCAAGGCTG CCCAAACCAC 
23151 CACTGCTGTC AGCAAGCACA TCAGACTCTA GCCTGGACAG TGGCCAGGAC 
23201 CGTCGAGACC ACCAGAGCTA CCTCCCCGGG GACAGCCCAC TAAG6TTCTG 
23251 CCTCAGCCTC CTGAAACATC ACTGCCCTCA GAGGCTGCTC CCTTCCCCTG 
23301 GAGGCTGGCT AGAAACCCCA AAGAGGGGGA TGGGTAGCTG GCA6AATCAT 
23351 CTGGCATCCT AGTAATAGAT ACCAGTTATT CTGCACAAAA CTTTTGGGAA 
23401 TTCCTCTTTG CACCCAGAGA CTCAGAGGGG AAGAGGGTGC TA6TACCAAC 
23451 ACAGGGAAAA CGGATGGGAC CTGGGCCCAG ACAGTCCCCC TTGACCCCAG 
23501 GGCCCATCAG GGAAATGCCT CCCTTTGGTA AATCTGCCTT ATCCTTCTTT 
23551 ACCTGGCAAA GAGCCAATCA TGTTAACTCT TCCTTATCAG CCTGTGGCCC 
23601 AGAGACACAA TGGGGTCCTT CTGTAGGCAA AGGTGGAAGT CCTCCAGGGA 
23651 TCCGCTACAT CCCCTAACTG CATGCAGATG TGGAAAGGGG CTGATCCAGA 
23701 TTGGGTCTTC CTGCACAGGA AGACTCTTTA ACACCCTTAG 6ACCTCAGGC 
23751 CATCTTCTCC TATGAAGATG AAAATAGGGG TTAAGTTTTC CATATGTACA 
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23801 
23851 
23901 
23951 
24001 
24051 
24101 
24151 
24201 
24251 
24301 
24351 
24401 
24451 
24501 



AGGAGGTATT 
ATGTGTAAGT 
GGCCTCTCAT 
CCTAACAGGT 
TCAGCCAGAC 
ACAAGTCTTC 
TTCTTGGCCC 
TGCCAATGGA 
TCACCGTGCA 
AGCCAATGAC 
CAGGCCCAGG 
AGAGTCCCCT 
GATCCTTAAC 
AAGGAGATTG 
GCCACATGTC 



GAGAGGAACC 
GTTTTGTAAA 
CACTGCTTTT 
TGGGCTGGCC 
CTGCGTTGTG 
TTAGAGTTAA 
AGGAGCATTG 
ACTCAAAACT 
CAGCACCCGA 
AACTCACTCT 
AGGTGGGAGA 
TTGAACCAAG 
CCCCAAGTAA 
TTGACCCAGT 
TGAGGGTTGC 



CTACTGTTGA CTTGAAAATA AATAGGTTCC 
ATTTCAGTGG AAATGCACAG AAAATCTTCT 
CTCAAGCTTC TTCAGCTTAA CAACCCCTTC 
CAGCCTAGGA AAACATCCCC ATTTCTAACT 
TGTCTGTGTG TTGAGTGAGC TGGTCAGCTA 
AGGAGGGGGT GCTGGCCAAG AGCCAACACA 
C II M CTGTG AATTCATTAT GCCATCTGGC 
TGGAAGGCGA AGGACAATGT TATCTGGGAT 
AGTGCCAAAT TCCAGGAGGA CAAGAGCCTT 
CCCCTACTCC ACCTCCTTCC AAGTCCAGCT 
AGGTCACAGA GCCTCAGGAA TTTCCAAGTC 
TATCTAGATC CCCTGAGGAC TTGATGAAGT 
TCATTAACCC CCAGACCAGC CTCAGAACTG 
GACCTGGAGT TGAGGCTCAG GGAGAGATCT 
AGAGCC (SEQ ID NO: 3) 
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48 


CTGGGTTCCTATGTGGGGAGGTCATGCTCCCCACTCATTGAGCCCCC 



[C.G] 



CAGGCAAACCACCTGGACAGCCAGACCCATGCAGACTCTGGAGCAGGTGGAGAGGAAGAG 
TGAGACCACCCCGCCTCACGGGCGGTGAAGGGCCGGCAGCCTCTGAATAGTCTCTGCTAG 
GAGGTAGAAAGCACCCTCCCATCTTAATCATAGTAATCATCGCCACTACCATTTACTGGG 
TGCCTATAAAAGGCCAGCCTCTTCATACACATGATCTCACTGAATCCrCATAGCATCTGC 
CTGCGACTGTTATTATCCCCATTTACAGATGAAGAAACTGAATCTTTGAACCOVGGTC^^ 

132 CTGGGTTCCTATGTGGGGAGGTCATGCTCCCCACTCATTGAGCCCCCCCAGGCAAACCAC 
CTGGACAGCCAGACCCATGCAGACTCTGGAGCAGGTGGAGAGGAAGAGTGAGACCACCCC 
GCCTCACGGGC 
[G,A] 

6TGAAGGGCCGGCAGCCTCTGAATAGTCTCTGCTAGGAGGTAGAAAGCACCCTGCCATCT 
TAATCATAGTAATCATCGCCACTACCATTTACTGGGTGCCTATAAAAGGCCAGCCTCTTC 
ATACACATGATCTC>CTGAATCCrCATAGCATCrGCCTGCGACTGTTATTATCCCCATTT 
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ACAGATGAAGAAACTGAATCTTTGAACCCAGGTCATCTGGCTCTCAAACTTGTGCTGTTT 
TCCCTAAGCCACCCGGTCTCTCATTTCrCCCACrGAAATGTCTCACATGCCATTGCCC^ 

724 ATTGCCCTTACTCATTTCTGCCCATGTCTCCrCCAAAACACCATITATCAA^ 

CAAGTATGTGTTGAGTACACACTAAGGGCCAGGCGAGGGGCTGGGCACAGGCGCTGGGGG 
TAGGTTCATTCTCCCACCTTCGCTTCTGCrGGGTATCACCrGTGGGGTCTTGCCGGGCA^ 
CCCACCCTCACCTGTAGTTCAAGTGGACCTTGGGATCCCAAGACCAAATGAATGGAATGC 
ACCAGCCCAGCCTTCACCAACTTGAGCACAATCTTATTCATAATAGAAACTCACATTTGC 
[A,C] 

TCACACTTTACATTTTACACAACCCCI I CI I ATCCATTAACTCATTTGATCTTCACAACA 
ACCCTGTGAGATATGTCTGTTACrCCCACTTTAGTGATACAGAATCTGAGGTTTGAAAAG 
TAATGCTGACCATrCrGCCTCATTAATAAAAGCAGGATTAACCCAGGCTCCTGGACCCTT 
CCACAAAAGGCATTAAGCAACCrGCTCCCCTCTGACAACCrCCCCTGTCACCCAGGCTCT 
CCTCrGGGAAGTTGGGGGCATCTCTAGCCCCCAAGTAGTTACTCATTTTCAACCCCATCT 

1558 TCAGCTCTGCCCATCrCAGCTCCTGGAACGTCAGCCAGGTTGCGCAAAAAGTGAGGAGGA 
GAGGAGCGGCAGTACACAAGGGTGGGGGAAAGATTAGGCACAGGAAGCCGTGGGAGAGAG 
AGCCGGCAGGTGGACCATCCTGGTTTCCCCACACACACCATTGTCCCCCTGGGAAACCTG 
TTGGTGAAGTTCTAGATGTCTTATCCAAGAAGGGTCCTCTTGAGGTCATCTCAGCTATCC 
CCCTGCCTCTAGGCAAGCTG I I I ICTGI MCI I CCAAGCTGACTGGCTGAATGGTAGGAG 
[C,G] 

CTTTCTGCCAGGGAAACTAAGGTCTGGGAAGGGAGTATGGCTTGTGGGGACACCAGGGGT 
CAGGGGAGGGGAGGGTCCACCTGCTGAATCAAGTGGGGCCTCCTGCCCTCGTGATTCCCC 
TTTGCCTGGTGCTCAGTGGGGGTGATGGTGACGCCACAGGTGTGGAGTGCCAGCCACGTG 
CTGAGCGCCAAGCAAAACAGCCAGGGTGAGTCTATGCATCATCAGTGCCTGG6AAGGAAG 
GCCACTGCGAGCAGGGAGTCTGACGGAAAAACTTGACAGAGGGAAGGGAGGCACCTTGCT 

1577 CTCCTGGAACGTCAGCCAGGTTGCGCAAAAAGTGAGGAGGAGAGGAGCGGCAGTACACAA 
GGGTGGGGGAAAGATTAGGCACAGGAAGCCGTGGGAGAGAGAGCCGGCAGGTGGACCATC 
CTGGTTTCCCCACACACACCATTGTCCCCCTGGGAAACCTGTTGGTGAAGTTCTAGATGT 
C TTAT CCAAGAAGGGTCCTCTTGAGGTCATCTCAGCTATCCCCCTGCCTCTAGGCAAGCT 
GTTTTCTGI I I CI I CCAAGCTGACTGGCTGAATGGTAGGAGCCTTTCTGCCAGGGAAACT 
[A,G] 

AGGTCTGGGAAGGGAGTATGGCTTGTGGGGACACCAGGGGTCAGGGGAGGGGAGGGTCCA 
CCTGCrGAATCAAGTGGGGCCTCCTGCCCTCGTGATTCCCCTTTGCCTGGTGCTCAGTGG 
GGGTGATGGTGACGCCACAG6TGTGGAGTGCCAGCCACGTGCTGAGCGCCAAGCAAAACA 
GCCAGGGTGAGTCTATGCATCATCAGTGCCTGGGAAGGAAGGCCACTGCGAGCAGGGAGT 
CTGACGGAAAAACITGACAGAGGGAAGGGAGGCACCrTGCTTTATCGGGGCGGGGAAGGC 

2487 ACACGGCTTCTGCACTGGTATCCCTAAGATGGGGTTAAGGGAAGCCCTGGGGAAGTGAGG 
TTCTGAATGATGAATTTAAGATCCTACAACCTCATCTGTACTGAGACCCCCAGGGAGGAT 
GGGGAGCAGGAGCAAGAACCATCCAGAAGGGTTATATGGCATTCCCAAACCCCTGCATGG 
CATCTCCCATATTCTCAATTCACCCGGGTCTCTCTGGG I I IGI I AAGGCATGGTAGATGA 
GCATCTACGTTATGGAGGGGTGGGGAGCATCAGAGCCCTTACTCCATGCCCTGTTCCCTC 
[C,A] 

TTACAAAAAATACCTGAAGTTACCATCACCCCAGGI I CI I I GTCCTTTCCCTCCCGGATG 
TTCCTTCCTCCACTTGGTCCAGAGAATGCCAAAAGGAGGCCCTAAATTTCTGAACTTTCC 
TGAGGGGACCTACCAGGGTGTAGTCCTACCAGCGCCCAGGGTCTTTCCACTCTCATCTCC 
CTGGAAATGCGATGGTGGGTATGAAACCTTGTCCCTAAGTAGGCGCTACACAAGGTGATC 
CATACCCACACCCCAGGAGGCTGGGGCTGCGGGTGTCACCCTCCCCATTCCCAGACTCCT 

2634 AGGGTTATATGGCATTCCCAAACCCCTGCATGGCATCTCCCATATTCTCAATTCACCCGG 
GTCTCTCTGGGI I IGI I AAGGCATGGTAGATGAGCATCTACGTTATGGAGGGGTGGGGAG 
CATCAGAGCCCTTACTCCATGCCCTGTTCCCTCCTTACAAAAAATACCTGAAGTTACCAT 
CACCCCAGGTTCTTTGTCCTTTCCCTCCCGGATGTTCCTTCCTCCACTTGGTCCAGAGAA 
TGCCAAAAGGAGGCCCTAAATTTCT6AACTTTCCTGAGGGGACCTACCAGGGTGTAGTCC 
[T.C] 

ACCAGCGCCCAGGGTCTTTCCACTCTCATCTCCCTGGAAATGCGATGGTGGGTATGAAAC 
CTTGTCCCTAAGTAGGCGCTACACAAGGTGATCCATACCCACACCCCAGGAGGCTGGGGC 
TGCGGGTGTCACCCTCCCCATTCCCAGACTCCTGGCAGACCTCCTCTGGCCCAGCTATAG 
GCCAACTCACTCTCCCTCACTCCCTTGGGGAAACGGCTGATTCAGTTACCTGGATTGAGG 
TCACTGGCAATGGCTGAAGTGGAGACGCAGGTGGAACTGGTTCAGGCCGGGGGAATCACC 

4352 ATTGGAGTTACCACACATAAAGGATAGTGAGTCAGCAGAGTGCACCCTGCAGGAACAATA 
GAGCCTTCCI I I I CAAGGAAGTTCTAAGAAAAATGGCAGCAGGCAGGCCCCACTCGGGTG 
TATTCACTCATTCATTTATTCAACAAATATTTACTAAGTGCCCCTGTGCAAGGCTCGAGG 
TGTACAAAGATGAACAGGAGAGCTAGAC I I C I I GCCATGCGTGGTGGGGTTTGCTGCCTA 
GTGGGAGAGACAGACAAAAAGCAAGGAATGCACACACAGGATGCACACACAGCGGCAGGA 
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[A.G] 

CCAAGGTGCAGTTACCCAGGCCTGGGATCAGACAGACAGGACrCAGAGGAGACTTTCCCA 
GAGAAAAGCCATCTGAGCCAAGGGATGGATCTGATACCTCCGAAGGCTGAGCCACCATAA 
CACTCATACCrrrAAGCCAAGTCTTATAAACTCCCCAGGTAAGCAGCrGGCAGTC^ 
ACCTCCAGCTAATGCCCAGGACAAGTTGATGAGCTCTCAAGAAAAAGTTCCTGCLI NIC 
rrcrCAATATCCCTGGCACACAGrrCAGTGAATTTTGAATGAACCAATGAATGAAATGAG 

ATCCAGGTCCCACAAGGTGAAGGGGCTCCTTCAGCCAGGCCTGGATTGCCACTCCCCTCA 
CCATTCCTCTCCTCATCCCCACTCCATCCCTCTGTGATCCCCATAAGCTAGTCATGCTGC 
TGAGCTTCAGTCTCGTTGTCCTCTGCAGGCATGGCATTTGCTCTGCTGGCCAACOTCCT 
GCAGTCAATGGCCTCTACTCCTCCI I CI I CCCCCTCCTGACCTACI I CI ICCTGGGGGGT 
GTTCACCAGATGGTGCCAGGTAAGGCCTCTCCCCTCTGGGCAGGCAGGATGACCCAGACC 
[A,C] 

CAAGGATGGGAGGTGTGGCAAAGGGGCCTCGGGAGATTTTCCATCTGCATTCTCCTGGAG 
TTGTTCCTGGTCAGTCCTAGGGGAATGGTCACTGTGAATGTCATTTCCAGGTCCTCGGTG 
ACCITGGAGAAACCACrGAGCCTCrTTGAGTTCAGTTAGCATTACCrGTTCCATCTTCCT 
CCTAGGAATGAGAGGAAGACTTAGCAGAACAAGATATACCATATGCTATAACATGCTTAA 
ACAGATGTGAGAAATCACCATCTAACTCCCTGGTTGGTCCCAGCCGGCCACTACAGGGAC 

TTAGCAGAACAAGATATACCATATGCTATAACATGCTTAAACAGATGTGAGAAATCACCA 
TCTAACTCCCTGGTTGGTCCCAGCCGGCCACTACAGGGACATTTGGAOTCTCTGGTGCT 
AAGTGAGATGGAGGAAAGCCTGGTCACAAGGGCTGGTTTCTGGTTCAGGCTCTGCTTATA 
I I ICI lATTTCTGAGTTCATTTTCTCACGTGTCCTGTATGACAATATTGACCATTGGGGT 
AAAAGCACCTTGAAAAGCATAGATCATGGTTAGAGTGAGTGGTTGTTATTATTGTGTTGG 
[A,T] 

GAAGAGCCTTGGAGGTGCAGGGATCCATCCCCCTGGGGTCGGGAAGCATTCCTGGGCCCC 
TTTCTGGTTTCCATCGGTGTGGTTCAAACCTCTGAl I I I I GCTGGCTGGGTGGGGCACCA 
CAGGTACCTTTGCCGTTATCAGCATCCTGGTGGGTAACATCTGTCTGCAGCTGGCCCCAG 
AGTCGAAATTCCAGGTCTTCAACAATGCCACCAATGAGAGCTATGTGGACACAGCAGCCA 
TGGAGGCTGAGAGGCTGCACGTGTCAGCTACGCTAGCCTGCCTCACTGCCATCATCCAGG 

ATTATTGTGTTGGAGAAGAGCCTTGGAGGTGCAGGGATCCATCCCCCTGGGGTCGGGAAG 
CATTCCTGGGCCCCTTTCTGGTTTCCATCGGTGTGGTTCAAACCTCTGAI I I I I GCTGGC 
TGGGTGGGGCACCACAGGTACCTTTGCCGTTATCAGCATCCTGGTGGGTAACATCTGTCT 
GCAGCTGGCCCCAGAGTCGAAATTCCAGGTCTTCAACAATGCCACCAATGAGAGCTATGT 
GGACACAGCAGCCATGGAGGCTGAGAGGCTGCACGTGTCAGCTACGCTAGCCTGCCTCAC 
[T,C] 

GCCATCATCCAGGTGAGGGGGCAGCCCCCAACCCTGCTAGAAGGGCATCAGACCACCCTG 
CCCCrCCCTCAAAGCCITAGCTTTGATGCTAAATCTGATTTAGGGGGCTGGGTGTGGAGG 
CTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGAGGGTGGATCACTTGAGGTCAGG 
AGTTTGAGACCACCTTGACCAACGTGATGAAACCCCATCTCTACCAAAAATACAAAAATA 
ATCCAGGCTTGGTAGTATGCGCCTGTAGTCCCACCTACTCAGGAGGCTGAGGCAGGAGAA 

GCTAGAAGGGCATCAGACCACCCTGCCCCTCCCTCAAAGCCTTAGCTTTGATGCTAAATC 
TGATTTAGGGGGCTGGGTGTGGAGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCTGA 
GGAGGGTGGATCACTTGAGGTCAGGAGTTTGAGACCACCTTGACCAACGTGATGAAACCC 
CATCTCTACCAAAAATACAAAAATAATCCAGGCTTGGTAGTATGCGCCTGTAGTCCCACC 
TACTCAGGAGGCTGAGGCAGGAGAATCACTTGAATCCGGGAGGCAGAGGTTGCAGTGAGC 
[C,T] 

GAGATCGCGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCTCAAAAAAAAA 
AAAAAAAAAAAAAAAAAAACCCAAGTTAGGGCTCACCTCCTCCCTCCTCCCCATCCCAGG 
GCTAAAGTGAACCTTGAAAATTAACAGTATCTCCTCATCTGCATGTAGCAGGACCATACA 
AAAAAACAACAGCTGTACCrGGTrAAACTGTCCTGAGCrTTAAACCTGTAAAAGACTCAC 
AGCCTCTCTCCATTATCCCGTGGAGAAACCCAACrCTCTGCCAGCATAGTCrrGCAGACT 

ATGAAACCCCATCTCTACCAAAAATACAAAAATAATCCAGGCTTGGTAGTATGCGCCTGT 
AGTCCCACCTACTCAGGAGGCTGAGGCAGGAGAATCACTTGAATCCGGGAGGCAGAGGTT 
GCAGTGAGCTGAGATCGCGCCACTGCACTCCAGCCTGGGTGACAGAGCGAGACTCCGTCT 
CAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAGTTAGGGCTCACCTCCTCCCTCCTCC 
CCATCCCAGGGCTAAAGTGAACCTTGAAAATTAACAGTATCTCCTCATCTGCATGTAGCA 
[G.C] 

GACCATACAAAAAAACAACAGCTGTACCTGGTTAAACTGTCCTGAGCTTTAAACCTGTAA 
AAGACTCACAGCCTCTCTCCATTATCCCGTGGAGAAACCCAACTCTCTGCCAGCATAGTC 
TTGCAGACTGCTAATTTTCTCTAACATCCCTCACTCCGCTCCAGCCTCCTCTGCTCCAAG 
CCACAGCAGCAGTTGCACAACATAAATTGAGCTTCTGCAAATGGTTGCAAAGGATTCTGC 
TAGGTTTTATGAAGGGAAGCACAACATGACAGAATGCAAGAGCAAAACACAGTCCCAGAG 

GTGACAGAGCGAGACTCCGTCTCAAAAAAAAAAAAAAAAAAAAAAAAAAAACCCAAGTTA 
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GGGCTCACCTCCTCCCTCCTCCCCATCCCAGGGCTAAAGTGAACCTTGAAAATTAACAGT 

ATCTCCTCATCTGCATGTAGCAGGACCATACAAAAAAACAACAGCTGTACCTGGTTAAAC 

TGTCCTGAGCTTTAAACCTGTAAAAGACTCACAGCCTCTCTCCATTATCCCGTGGAGAAA 

CCCAACTCTCTGCCAGCATAGTCTTGCAGACTGCTAATTTTCTCTAACATCCCTCACT^ 

[T.G] 

CTCCAGCCTCCTCTGCTCCAAGCCACAGCAGCAGTTGCACAACATAAATTGAGCTTCTGC 
AAATGGTTGCAAAGGATTCTGCrAGGTTTTATGAAGGGAAGCACAACATGACAGAATGCA 
AGAGCAAAACACAGTCCCAGAGAGCGCC I I I I CATTCACTCATTCATTCGG I I I IGTGCC 
AAGAACTAGGCTAAACCCTGGGATACAAAGATAAGTAAGAAAGAGGTCCAATTCACAAGT 
TGCTCACAGCCCAGCAGAGGAAGGAGCCATGTCAACAGATAAATTTGTATGCAGTGAGAT 

7247 GACACAGAGCAGAGTCACGGAGGACCTCAAAGAGGAGGTGACACTCCACCTCTCTTAAAG 
GATGAGAACTTAACCAGGAACAAGGTATACAGAGGATGGTCCAGGCAGAAGGGAACAGTG 
CCTAAAAACACTGAGGCCTGAGAGAGTGTGATCTGCGCAGGCAAAGTAAGGGGCTTGGTG 
TGGCTGGAGGGTAGAGGGCCCAGAAGAGGATGGAAAAGTAGGCAGGAGCCAGACAATGAG 
ATCTGGGGTCTGTTCTCTGACAGCGACTTTGGGTCTGATTGGCAGTTTATAAGGATCGTT 
CT,C] 

GGGCTACACAATGATGAGTGGGAGGTGGATTAGAATCAAGGCAGGGGACCTGTTGGGAGA 
CTCTGCAGAGGCCCAGGCAGGAATAATGCAGGCGAAGACCAGGTAGAGAAAGAGATGGGG 
CTGGAarGAAAAGAATGTTTTACCAGGAGCTTGGTGATAGACTGGATGTGGGAGGTAAG 
GGAGGATGACTCTCAAG I I I I I GGTTGGGCAACCAGGTTAATGATGGTGTCATTTACTGA 
GAGAGAAAACACTGGGGGAGGACTAGACTTATTTTACAGATAAGCCAAAGCCAGAGAGGT 

7360 AACAGTGCCTAAAAACACTGAGGCCTGAGAGAGTGTGATCTGCGCAGGCAAAGTAAGGGG 
CTTGGTGTGGCTGGAGGGTAGAGGGCCCAGAAGAGGATGGAAAAGTAGGCAGGAGCCAGA 
CAATGAGATCTGGGGTCTGTTCTCTGACAGCGACrrrGGGTCTGATTGGCAGTTTATAAG 
GATCGTTTGGGCTACACAATGATGAGTGGGAGGTGGATTAGAATCAAGGCAGGGGACCTG 
TTGGGAGACTCTGCAGAGGCCCAGGCAGGAATAATGCAGGCGAAGACCAGGTAGAGAAAG 
[A,G] 

GATGGGGCTGGACTTGAAAAGAATG I I I I ACCAGGAGCTTGGTGATAGACTGGATGTGGG 
AGGTAAGGGAGGATGACTCTCAAG I II I I GGTTGGGCAACCAGGTTAATGATGGTGTCAT 
TTACTGAGAGAGAAAAC^CTGGGGGAGGACTAGACrrATTTTACAGATAAGCCAAAGCCA 
GAGAGGTGATGTGACAGAAAGGCCCATGCTCTAAAGGAGCTGAAGGTCTGATGGCAGCCA 
TGTAGAGCACAGTGAAGGGCAGGTGAAGGTCACAGATGGTCCAATTCCCTCAAGCTACTG 

7644 GACCAGGTAGAGAAAGAGATGGGGCTGGACTTGAAAAGAATGI I I I ACCAGGAGCTTGGT 

GATAGACTGGATGTGGGAGGTAAGGGAGGATGACTCTCAAG I II I I GGTTGGGCAACCAG 
GTTAATGATGGTGTCATTTACTGAGAGAGAAAACACTGGGGGAGGACTAGACTTATrrTA 
CAGATAAGCCAAAGCCAGAGAGGTGATGTGACAGAAAGGCCCATGCTCTAAAGGAGCTGA 
AGGTCTGATGGCAGCCATGTAGAGCACAGTGAAGGGCAGGTGAAGGTCACAGATGGTCCA 
[A,T] 

TTCCCTCAAGCTACTGCTACGCTAGGACTGCACGGAGCTCCAGACCTGCGTGTGTGTGGG 
GCGGGTCGTTGGAACTGCTGAACCACATTGGTCTTCCGCCACCAACCACCCI I I ICCTCC 
TCrCAGATGGGTCTGGGCTTCATGCAGTTTGGCTTTGTGGCCATCrACCTCrCCGAGTC 
TTCATCCGGGGCTTCATGACGGCCGCCGGCCTGCAGATCCTGATTTCGGTGCTCAAGTAC 
ATCrTCGGACTGACCATCCCCTCCTAOVCAGGCCCAGGGTCCATCGTCTTTGTGAGTCTG 

8127 CATCCGGGGCTTCATGACGGCCGCCGGCCTGCAGATCCTGATTTCGGTGCTCAAGTACAT 
CITCGGACTGACCATCCCCTCCTACACAGGCCCAGGGTCCATCGTCTTTGTGAGTCTGGG 
GATGCACCCCTGCCATTGGAGCAAGGCTCCAGCAGACACATGAGGAGGATGTACTGTTTT 
AAGATGTCGTGAGCTCCTCATTGCAAGGGCTGGCTTAGCTGTTGTTCAGAGAGGATTCTG 
AGGGGGTTTCTGTCTTGGGAGGGTCAAAGTCATGACTCACAGAGGI I CI IGGTAGTTAAT 
[A,C] 

CCTGCAGAAAAGAGCTGTACATTCrCCGCCAGTTCCCCATTCTAGTGCCTCAACCCCTCC 
CTGCCTGGAAAGTCCTGCCTTATGTCTAATCTCCATCCCrCCTCCTTCAGCCCAAACTCT 
TCTAAAGAAAAAGAAAGCATTC C I I I I CTAGCACAAGTTCCCCATGTGC C I I I I GGGAAA 
GGGCGGTGGGCGACGGGACAGGGTTCCTGATCAGGGI I I I AATTCTGTCTTGGTGTGCCT 
CCATTAGCTTTGATGGCATCCCTTCCCTGGGTCAGACACCCAAAGGTGGGGTATTATGGG 

8317 GAGCTCCTCATTGCAAGGGCTGGCTTAGCTGTTGTTCAGAGAGGATTCTGAGGGGGTTT 
TGTCTTGGGAGGGTCAAAGTCATGACTCACAGAGGTTCTTGGTAGTTAATACCrGC^ 
AAGAGCTGTACATTCTCCGCCAGTTCCCCATTCTAGTGCCTCAACCCCTCCCTGCCTGGA 
AAGTCCTGCCTTATGTCTAATCTCCATCCCTCCTCCrTC^GCCCAAACTCTT 
AAAGAAAGCATTCCI I I I CTAGCACAAGTTCCCCATGTGCC I I I I GGGAAAGGGCGGTGG 
[G,A] 

CGACGGGACAGGGTTCCTGATCAGGG I I I I AATTCTGTCTTGGTGTGCCTCCATTAGCTT 
TGATGGCATCCCrrCCCTGGGT CAGA CACCCAAAGGTGGGGTATTATGGGAAGAAGGGGT 
GGGAGCCTGTGAGCATGATGCTCTTTCCCCCAGACCTTCATTGACATTTGCAAAAACCTC 
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CCCCACACCAACATCGCCTCGCTCATCTTCGCTCrCATCAGCGGTGCCTTCCTGGTGCT^ 
GTGAAGGAGCrCAATGCTCGCTACATGCACAAGATTCGCTTCCCCATCCCTACAGAGATG 

9079 TTCTACTGCTCTAATAATTCCCCCTAAGGAGGCAGGGGAGTGGGATTCAGGGTCCCCAGA 
GAAAAGGGAGACTTGAGAGAGACGCCTGCCCTGGCCCCACCTTAGGGCCAATCCCCATTC 
TCCACTCTGGGGTTTGCAGGTGGTGGTGGCAACAGCTATCTCCGGGGGCTGTAAGATGCC 
CAAAAAGTATCACATGCAGATCGTGGGAGAAATCCAACGCGGGTGAGTCCAGGTGGCCCA 
GAAGCCTGGCCCACCCGCACCTCATGCCCCACTAAGGCCTGAGCTCGGAGAGGGAGACAA 
[G,A] 

ATGAACTCTATGAAAGTGCAGTCGAAACTGTATGACACTGACCATGTATGAATTATTACT 
ATTACCGTTTCCTGAGAAGGGCCGCACAACCAGCCAATGTAGGCTATTTTATGAGAAATG 
AGTCTTAACTGCCACACTCCCCTTATAAATCTCATTCAACTGATGCTGTTAAACAAAGCC 
TCrCTGAACAGCCGaTGCTGGCTCTrrGCCTTGCTCTAATGCATTG G I I CM IGTCCAT 
GTAGAAAGGGAACTATTAGGTTCAACCAGATTCATGAAGCATCO\CTCTGTGCCAGGCAC 

95 37 AACTGATGCTGTTAAACAAAGCCrCrCTGAACAGCCGCTTGCTGGCTCTT^ 

TAATGCATTGG M C I 11 GTCCATGTAGAAAGGGAACTATTAGGTTCAACCAGATTCATGA 
AGCATCCACTCTGTGCCAGGCACCATGCTGGGCCCTGGGAGGAGAGGGGTGACGCTTGTC 
CTGCAGGGTTGGAACAGGCAAGGGAGGGAAGACCACATAGCACCAAAGGTCTAGGGGTCT 
GTGGACTCGTGAGCATACAGGGTTCAGAATCTGGGAGTTAACAAACGAGGCCCTACCACA 
CG,T] 

ACTGGCCCGGGGACCTTGGGCAAGTTAGGTTCrCrCAGCCrCAGTTTCarCCrr^ 

ACAGGAGTGATGGTCCCTACCCTATGGGGTGGTGCTGAGGATTCAGACTGGATGGGATAA 

CTTAGGCAAAGATCCCGGCACACCATGGGGGCCTGGCTGGTCCCTGTGGGCTGGTGAAG6 

AClTGGCTGCCCrCCCCACTCACACCCTTGGGTTCTGCCTCCrTCCT^ 

GTTCCCCACCCCGGTGTCGCCTGTGGTCTCACAGTGGAAGGACATGATAGGCACAGCCTT 

12 302 AGCCCCACCATAACCTATGGGAGAGGATTTACTAAA C I MC I I AACGGTGATGAAACCAA 

GGCTCAGAATGGTTAAGTAAATTGTCAAAGGCCACAGAGGTAGGGAGTGGTAGAGTCTGG 
ATTAAAACTCCAAGTCCrGGACTCCAGACCTCTAGGCTGTACTGTCTCATAGGGAAGGCA 
GTCTCACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGTGGCTCATC 
TGTGGTCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCAGTTCCCAG 
[C,G] 

CCAAGGATCTGCTTTGACCATAACCC^ACAAGCCCCCGCTATGGTGGTATTTCCTTAGGT 
TCATATGGCGGCI II I CM I CCATTTGATCTTCACAGCAATTCTCTACAGGAATCTGGGC 
AGATTTATTTCCTTTAGAGGAATTTCCAGGTCITAAAATCTATAGGGGGCAACT^ 
ACTTCACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGATCAGAAAG 
CACTGCTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAGGGAACTCA 

12354 GAAACCAAGGCTCAGAATGGTTAAGTAAATTGTCAAAGGCCACAGAGGTAGGGAGTGGTA 
GAGTCTGGATTAAAACTCCAAGTCCTGGACTCCAGACCTCTAGGCTGTACTGTCTCATAG 
GGAAGGCAGTCTCACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGT 
GGCTCATCTGTGGTCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCA 
GTTCCOVGCCCAAGGATCTGCTTTGACCATAACCCAACAAGCCCCCGCTATGGTGGTATT 
[C.T] 

CCTTAGGTTCATATGGCGGCI M IGI I I CCATTTGATCTTCACAGCAATTCTCTACAGGA 
ATCTGGGCAGATTTATTTCCTTTAGAGGAATTTCCAGGTCITAAAATCTATAGGGGGC^ 
CTATCAAAACTTCACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGA 
TCAGAAAGCACTGCTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAG 
GGAACTCACATTTATTGATCACCTACTGAGCATCCAT<>CrAGGCTAGGACCGTCACATT 

12487 ACCCACCTAGGGCAGAGAAGAAAATCCTTAAAGCCAGAGAAGTGAGTGGCTCATCTGTGG 
TCACCCAGAGAGACAGTGATGAGGACAGGGAGAAAAATTATACCTCAGTTCCCAGCCCAA 
GGATCTG CTTTG ACCATAACCCAACAAGCCCCCGCTATGGTGGTATTTCCTTAGGTTCAT 
ATGGCGGCnTTGTTTCCATTTGATCTTCACAGCAATTCTCTACAGGAATCT^ 
TTATTTCCTTTAGAGGAATTTCCAGGTCTTAAAATCTATAGGGGGCAACTATCAAAACTT 
[C.T] 

ACCCAATGTTGCCCCCTACCCACACACAAAACCAGGCCCCCAGCCGATCAGAAAGCACTG 
CTGAGCTCCTGTCAGGGCCCACGCAGCTCGCTGTGAGACAGAGAGAGGGAACTCACATTT 
ATTGATCACCTACTGAGCATCCATCACTAGGCTAGGACCGTCACATTCCTTAA C I I I I G A 
ATCCTTTCATGAGGTAGGCATTATTATTCTCCI I I IGI I I CACATAGCCATTAAAGAACA 
AAATTTGGGGCTGGGTGTGCTGACTCACACCTGTGATCTAGCACTTTAGGGGGCTGAGGC 

13 198 CTAACTATTCAGGAAGGTTAGGCGGGAGCACAACTTGGGTTCCAGGGTTTGAGGCrC^^ 
TGAGCTGATCTTGCCACTG CACTA CAGCCTGAGCAACAGAGi^AGACCCTGTGACTC^ 
AAACAAACAAACAAACACATTTTGAACCCAAACAGATCTGACCCAAGATGCATGCTCTTA 
TAGATGCCACCTCCCTGTGTGCTGGGGCTTCTACTAAAAACACAGACAAGATCAGGCAAC 
CACAGTCAATCTAAGGGAAAGAGGAAAGTGTAACCAAAGCACAAATACATAAAATATTGC 
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[-.A] 

AAAATGCrATTTAAAGAAAAAAAAGAGAAGAGAGGCTCTGAGGTTGTACTAACAGAGAAT 
GGCCTTGGCTAATCCAGGAAGACTTCCTGAAAGAG G I IG I I I II I CCCCAGGTCTGCnT 
TGACATCTCT C I I I I CACAGTGCATCTGGGTAGTGAGCTTCCTCTCCTC C I I C I I CCTCA 
GCCTGCCCTATGGTGTGGCAGTGGGTGTCGCCTTCTCCGTCCTGGTCGTGGTCTTCCAGA 
CTCAGTTGTAAGTGATAGCTTCCGCCCrCCTAGGCCCACAGTCGGTTCCCTGGGCCAGCC 

132 57 GTGAGCTGATCITGCCACrGCACTACAGCCrGAGCAACAGAGCAAGACCCrGT^ 

AAAACAAACAAACAAAOVCATTTTGAACCCAAACAGATCTGACCCAAGATGCATGCTC^ 
ATAGATGCCACCTCCCTGTGTGCTGGGGCTTCTACTAAAAACACAGACAAGATCAGGCAA 
CCACAGTCAATCTAAGGGAAAGAGGAAAGTGTAACCAAAGCACAAATACATAAAATATTG 
CAAAAATGCTATTTAAAGAAAAAAAAGAGAAGAGAGGCTCTGAGGTTGTACTAACAGAGA 

[A,G] 

TGGCCTTGGCTAATCCAGGAAGACTTCCTGAAAGAGGI IGI I I I I I CCCCAGGTCTGCTT 
TTGACATCTCTC I I I I CACAGTGCATCTGGGTAGTGAGCTTCCTCTCCTCCI I CI I CCTC 
AGCCTGCCCTATGGTGTGGCAGTGGGTGTCGCCTTCTCCGTCCTGGTCGTGGTCTTCCAG 
ACTCAGTTGTAAGTGATAGCTTCCGCCCTCCTAGGCCCACAGTCGGTTCCCTGGGCCAGC 
CCGCAAAGGGCTTCCATGCCACGGCCTGGCTTAGTCCACTGTACCTTCCACCTCTGGGCC 

14 541 TCATGGACACTGACATTTATGTGAATCCCAAGACCTATAATAGGGTAGGTAATTCAAGCT 
TATGACCTCCI ICI M I GCTCTGCACCACCCCAAGAAGAGGTTGC I I I I lAAAGCCAATA 
AAGACATTTCTGCAACTTGAGCTCAGTCTCCCTGTCACAGGCCCAGGATATCCAGGGGAT 
TAAAATCATCACGTACTGCTCCCCTCTCTACTTTGCCAACTCAGAGATCTTCAGGCAAAA 
GGTCATCGCCAAGGTAAGGCTCAGTCCCTGGCGACCAGAGGCTCTGGACAGAGAGTGGCC 
[G,A] 

GAAAATGGAAGCAGAAGGGCGGTGGGAGCTGAGAATAGGCCACTCCCATAGAGGGTGGAG 
GTCAAGATTGCTGTTGGCTCTCTCCCTGCAGACAGGCATGGACCCCCAGAAAGTATTACT 
AGCCAAGCAAAAATACCTCAAGAAGCAGGAGAAGCGGAGAATGAGGCCCACACAACAGAG 
GAGGTCTCTATTCATGAAAACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCT 
CTCCTGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCC 

14 54 5 GGACACTGACATTTATGTGAATCCCAAGACCTATAATAGGGTAGGTAATTCAAGCTTATG 
ACCTCCI ICI I I I GCTCTGCACCACCCCAAGAAGAGGTTGC I I I I I AAAGCCAATAAAGA 
CATTTCTGCAACTTGAGCTCAGTCTCCCTGTCACAGGCCCAGGATATCCAGGGGATTAAA 
ATCATCACGTACTGCrCCCCTCTCTACTTTGCCAACTCAGAGATCTTCAGGCAAAAGGTC 
ATCGCCAAGGTAAGGCTCAGTCCCTGGCGACCAGAGGCTCTGGACAGAGAGTGGCCGGAA 
[A.G] 

ATGGAAGCAGAAGGGCGGTGGGAGCTGAGAATAGGCCACTCCCATAGAGGGTGGAGGTCA 
AGATTGCTGTTGGCTCTCTCCCTGCAGACAGGCATGGACCCCCAGAAAGTATTACTAGCC 
AAGCAAAAATACCTCAAGAAGCAGGAGAAGCGGAGAATGAGGCCCACACAACAGAGGAGG 
TCTCTATTCATGAAAACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCC 
TGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCCACCC 

15041 ACCAAGGTGAATGAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATAC 
TGCCCCCTGTTACTCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACA 
GAAAGTCTCTGGGAACACTGCCTGGTGGCCGCTGGGCAI IIIICIILIIIIIIIICIIII 
TCI M I lAGAGATGGAATTTTGCTCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTT 
GGCTCACTGCAACCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGT 

[C,A] 

GCTGAGATTACAGGTGCCACCACACCCAGCTAAI I I I IGTAI I I I I AGTAGATATTGGGT 
TTCACCATGTTGGCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAG 
CCTTCCAAAGTGCTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCAI I I NCII 
CTTGGATGAGGTGCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCAT 
TTTCTGGCTAAGATAGGACATGGCCCATGGACTTTTGAACAACCCAGAGGGGGAACAGCA 

15053 GAAGGCCAGAAGCAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATACTGCCCCCTGTTA 
CTCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACAGAAAGTCTCTGG 
GAACACTGCCTGGTGGCCGCTGGGC AI I I II Li ! L I! i II I I I CI I I M CM II l A GAGA 
TGGAATTTTGCrCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTTGGCTCACTGCAA 
CCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCTGAGATTAC 
CA,C] 

GGTGCCACCACACCCAGCTAAI I I I IGTAI I I I I AGTAGATATTGGGTTTCACCATGTTG 
GCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAGCCTTCCAAAGTG 
CTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCAI M I ICI ICI IGGATGAGGT 
GCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGGCTAAG 
ATAGGACATGGCCCATGGACI I I I GAACAACCCAGAGGGGGAACAGCAGTGAATTTCCTG 

15065 CAGCCCCGTGCCCTGCTCTCCTGCCCATTCTGATACTGCCCCCTGTTACTCATGGTACCC 
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TGGGGGCCCCGCrr CCCAC CCTG ACAGGCAA AGACAGAAAGTCTCTGGGAACACTGCCTG 
GTGGCCGCTGGGCAI M I ICI ILI I I I M I ICI I I I ILI I I I I AGAGATGGAATTTTGCT 
CTTGTCACCCAGGCTTGAGTGC^ATGGCGTTATCrTGGCTCACTGCAACCrCC^^ 
GGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCrGAGATTACAGGTGCC^ 
[A,G] 

CCCAGCTAAI I I I IGTAI I I I I AGTAGATATTGGGTTTCACCATGTTGGCCAGGCTGGTG 
TCAAACrCCTGACCrCAGGTGATCCACCTACCTTAGCCTTCCAAAGTGCTGGGATTA 
GCCTGAGCCACTGCGCCCAGCCTGGGCAI I I I ICI ICI I GGATGAGGTGCTACCATCTCC 
CAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGGCTAAGATAGGACATGGC 
CCATGGAC INI GAACAACCCAGAGGGGGAACAGCAGTGAATTTCCTGGGGAACCCAGGC 

15108 TGTTACrCATGGTACCCTGGGGGCCCCGCTTCCCACCCTGACAGGCAAAGACAGAAAGTC 
TCTGGGAACACTGCCTGGTGGCCGCTGGGCAI II I ICI ICI I I I I I I ICI I I I ICI I II I 
AGAGATGGAATrrTGCTCTTGTCACCCAGGCTTGAGTGCAATGGCGTTATCTTGGCTCAC 
TGCAACCTCCACCTCTGGGGTTCAAGCGATTCTCCTGCCTTAGCCTCCCAAGTCGCTGAG 
ATTACAGGTGCCACCACACCCAGCTAAI I I I IGTAI I i I I AGTAGATATTGGGTTTCACC 
[A,C] 

TGTTGGCCAGGCTGGTGTCAAACTCCTGACCTCAGGTGATCCACCTACCTTAGCCTTCCA 
AAGTGCTGGGATTACAAGCCTGAGCCACTGCGCCCAGCCTGGGCAI II I ICI ICI IGGAT 
GAGGTGCTACCATCTCCCAGGGAAGCCACTGAACCCCCAAGGCCCTTCTCCATTTTCTGG 
CTAAGATAGGACATGGCCCATGGACTTTTGAACAACCCAGAGGGGGAACAGCAGTGAATT 
TCCTGGGGAACCCAGGCAGCCCAGGGCTAGCAAGGCTGGGGTGGCCATGGCAGTAATCCT 

162 74 CITCCAGACrGTCTCCCTGCAGGAGCrGCAGCAGGACTTTGAGAATGCGCCCCCCACCGA 
CCCCAACAACAACCAGACCCCGGCTAACGGCACCAGCGTGTCCTATATCACCTTCAGCCC 
TGACAGCTCCTCACCTGCCCAGAGTGAGCCACCAGCCTCCGCTGAGGCCCCCGGCGAGCC 
CAGTGACATGCTGGCCAGCGTCCCACCCTTCGTCACCTTCCACACCCTCATCCTGGACAT 
GAGTGGAGTCAGCTTCGTGGACTTGATGGGCATCAAGGCCCTGGCCAAGGTGAGGCCCTC 
[-.G] 

GGGACAGCAAGCACCACCC^CTCCACCCCCTCCGCrCTGCTCrCCACATTCCCTTTCCT^ 
GGAGCCCTC ATTT CAGGAAGCTGAGGGAGGAAGCTCACTGGGGAGACTAACAGCTCCTAG 
GAATCCCrCCTTTCCCCAGACGCCACCAGGTTGAGACATTCTCCACAGAGCAGGCCCAGA 
CGGCCCATGACAATGAGTGGCGGGACAAGTCTACCAGAGTTTCAGGCCCCTGTGCTCCCA 
ACACCCCCAGCAGTGGCCATCCCAAGTCCCTCTCAGCCATCAGGAACCCACCCAGGTTCT 

17424 AACATGGTGAAACCCCGCCTCTACTAAAAATACAAAAATTAGCCAGGTGTGGTGACGGGC 
CCCTGTAGTCCCAGCTACTCGGTAGGCTGAGGCAGAGAATTGCTTGAACCCAGGAGGCGG 
AGGTTGCAGTGAGCCAAGATCGCGCCACTGCACTCCAGCCTGGGCAACAGAGTGAAACTC 
CATCTCAAAAGAAAAAAAAAGAAAATATCTAGCCCCACAAGAAGGGGCCATGGTGACTTT 
AAGTGCCCGCCACGTTGGCAAAAGTCCATTTCCGCTCCACTTCCCAGAGAAACCGTCAGC 
[C,T] 

AACACTCCAGGGAGAAGTGGTGTGCTTTGCTGCTAI I I I I GTCTTTGGCTGCTGGGCTCT 
CAGGGTTGCTTAI I IGI I I GGCTTCCCCTCTGAAGTACG I I I IGTGAATCACI I I I GAGA 
CCCACTCAGAACATTCCTTTCCI I I I GCCTCCCTACCCCAACAACACTTCTAGCTGAGCT 
CCACCTATGGGAAGATCGGCGTGAAGGTCI ICI I GGTGAACATCCATGGTAAGAGAAAGA 
GGACATTTAGGGACTGAAAGACTGGCAAGGAGTGT6GGGTAGGAACAGGTTGGTGGGGTC 

17627 AATATCTAGCCCCACAAGAAGGGGCCATGGTGACTTTAAGTGCCCGCCACGTTGGCAAAA 
GTCCATTTCCGCTCCACTTCCCAGAGAAACCGTCAGCCAACACTCCAGGGAGAAGTGGTG 
TGCTTTGCTGCTAI I I I I GTCTTTGGCTGCTGGGCTCTCAGGGTTGCTTATTTGTTTGGC 
TTCC CCTCTGAAGTACGTTTTGTGAATCACI I I I GAGACCCACTCAGAACATTCCTTTCC 
TTTTGCCTCCCTACCCCAACAACACTTCTAGCTGAGCTCCACCTATGGGAAGATCGGCGT 
[G.A] 

AAGGTCTTCTTGGTGAACATC<>TGGTAAGAGAAAGAGGACATTTAGGGACTGAAAGACT 
GGCAAGGAGTGTGGGGTAGGAACAGGTTGGTGGGGTCTGAATAGTGAGGAGGTTGGAAAC 
GAGAGCACCCAGCTATCCCCCACAAGCTGCTGCCTGCTCATAAAAGCTTCAGGTACAAGT 
CCAAAGAGACTGGTCAGATTGCATAAACATCCTAGGGGCCTTAGTGACAGAGTGGGGGTG 
AGGAGGTCATGGAGTTACAGAAGGACAGCTAGGATTCTAATCTACCCCATAACTAATTTG 

1842 7 GGGTGCATATACACAGCCTCAAGGACGTGGCCACAGGGCAGCAGACATTTACATGACTAG 
CATGTACGCAAAGTGCAGAGATGTGGGAGCAAGTGCACACAGACACACAGGAGAATGTGA 
AGGGGCACATACACACACACCCAGCTCCCTGCACTGGGTCAGACCCCCTCCAGCAGGGCT 
GCAGTTCCCAAGCTCCGCATGGCCACGTTCGGGGAGAGAATCTGCAGTGGCAATGACCTG 
CTATGATATGTTCTGGAGTTAGAAGCAGTGGATTCTCCCCAACCTCACTGGACACCCCCT 
[T.C] 

AGGAAACCATCTCrAGGATTAAGAGTAATCCACACAAACTTCCAATGCCACACATTGGAA 
GTTGCTGGAAAGGTCTGGGAAAACAAGAGGAAGGATGGGTCCTTGGGGGATAGAACTGGC 
AGCGGCCTCTTCAAGGATGGCTTAGGCI I I I CCACTCGAATCACCACAAAGTACTGACTC 
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CCTAAATCAAACTGCTTCCTTCTGCTCrGGCrrTGAAACTTCAGCATCCTCAAG^ 
TGCCCrCTGCCGTCCAGAACTGATATTGCACTGCCAATGCCATGGCCCTCAGATACAGCA 

AGAGGAAGGATGGGTCCITGGGGGATAGAACTGGC^GCGGCCTCTTCAAGGATGGCTTAG 
G C I I I I CCACrCGAATCACCACAAAGTACTGACrCCCTAAATCAAACTGCTTCCTTCTGC 
TCTGGGTTGAAACTTCAGCATCCrCAAGTTCATGTTGCCCTCTGCCGTCCAGAACTGATA 
TTGCACTGCCAATGCCATGGCCCTCAGATACAGCAAGAGCTGGGACCTCAGGCCTCTCCC 
ATCCCTGCrCTGGTCTCACTATCTTCCCCACCCCCAGCTCC^TCCACAATGGCTGTTAT 
[C,G] 

TTTCTGAAGGTGATCI I I I CTCCTTCTAGCCCAGGTGTACAATGACATTAGCCATGGAGG 
CGTCITTGAGGATGGGAGTCrAGAATGC^AGCACGTCTTTCCCAGCATACATGA^ 
CCTCTTTGCCC^GGCAAATGCTAGAGACGTGACCCCAGGACACAACITCCAAGGGGT^^ 
Gil CI IGCACCTGGGGAATCCTAGGCrCCAAGGCACTGAAATAGCAGGACCAAGAG GCAT 
TATTAGAAAGAACACAGGAGAAGGTTTAAGTTCCAATATCAAGTCTGCCATTTCAGTTTT 

GGACCTCAGGCCTCTCCCATCCCTGCrCrGGTCTCACTATCTTCCCCACCCCCAG 

ATCCACAATGGCTGTTATCTTTCTGAAGGTGATCI I I I CTCCTTCTAGCCCAGGTGTACA 

ATGACATTAGCCATGGAGGCGTCTTTGAGGATGGGAGTCTAGAATGCAAGC^CGTCm 

CCAGCATAC^TGACGCAGTCCTCTTTGCCCAGGCAAATGCTAGAGACGTGACCCCAGGAC 

ACAACTTCCAAGGGGTAAGGTTCTTGCACCTGGGGAATCCTAGGCrCCAAGGCACTGAAA 

[T,C] 

AGCAGGACCAAGAGGCATTATTAGAAAGAACACAGGAGAAGGTTTAAGTTCCAATATCAA 
GTCTGCCATTTCAGI I I I CTGAATCTGTTTCCTTATCTATAGAATGAGCACCATCAACTA 
ACATTACCTACCTCTCTGCAI i I I I CM II Al I MGI I I I AGGGTTAAATGATAATTACA 
TCI I I IGTGTCACTTGAAAGCACTTTGTGTATTGTAAAAATTCTTTATCAATATAAGTTT 
TCTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCATCGTTCCC 

AGGATGGGAGTCTAGAATGCAAGCACGTCTTTCCCAGCATACATGACGCAGTCCTCTTTG 
CCCAGGCAAATGCTAGAGACGTGACCCCAGGACACAACTTCCAAGGGGTAAGGTTCTTGC 
ACCTGGGGAATCCTAGGCTCCAAGGCACTGAAATAGCAGGACCAAGAGGCATTATTAGAA 
AGAACACAGGAGAAGGTTTAAGTTCCAATATCAAGTCTGCCATTTCAGTTTTCT GAATC T 
GTTTCCTTATCTATAGAATGAGCACCATCAACTAACATTACCTACCTCTCTGCAI M I IC 
[T.C] 

TTTAI I MGI I I I AGGGTTAAATGATAATT ACATC I I I I GTGTCACTTGAAAGCACTTTG 
TGTATTGTAAAAATTCTTTATCAATATAAGTTTTCTGGTTGCACAAACACCCAAAGCATA 
GTAGAGCAGGCCCACTCTGCTGGCATCGTTCCCTGCCTCCTCCTCATCTCTTTCTAAAGG 
GGGCTTTCGGGAAGGGAGGGGAGGGGAGTAAGCCTACCCATTTTAACTTACCGGAGCTTA 
GAGATTTCAGGCTGGTGAGGGATAAAGAGATTGGGTCTGAGTTTTGTCTCAGCI I I MGA 

TAATTACATCI I I I GTGTCACTTGAAAGCACTTTGTGTATTGTAAAAATTCTTTATCAAT 
ATAAGI I I ICTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCA 
TCGTTCCCTGCCTCCrCCTCATCTCTTTCTAAAGGGGGCTTTCGGGAAGGGAGGGGAGGG 
GAGTAAGCCTACCCATTTTAACTTACCGGAGCTTAGAGATTTCAGGCTGGTGAGGGATAA 
AGAGATTGGGTCTGAGTTTTGTCTCAGCI I I I I GACATTTAATTTACTAGCTCAGTAAGT 
C-,G,C] 

ATACAAATGGGATACAAATAACACCATCTAAAACTCCAGAAGACTGGGGAGTCAGAAAAA 
TCCTACCTCCTTGGGGTCCCTGCCCAGATCCCCAGTCATCTCTAGCCCTCAGGGTCCCCT 
CCCAGCTCAGCTCCTGCCCTTGGCCTCCCAAGACrCI IGI I GTGCCCCAGCCCTGGGTAA 
AAACCTCCCCTGCCCrCTGTGGGTCATAAGAAAGGCI I I I CTGGCCCTAGAGCAATGATT 
TGCTCTTTGCCTTAAGAGACTGATGAAGGTGAAACCATCTGTTCTAAGTGCTGAAAGACr 

AGTTTTCTGGTTGCACAAACACCCAAAGCATAGTAGAGCAGGCCCACTCTGCTGGCATCG 
TTCCCTGCCTCCTCCTCATCTCTTTCTAAAGGGGGCTTTCGGGAAGGGAGGGGAGGGGAG 
TAAGCCTACCCATTTTAACTTACCGGAGCTTAGAGATTTCAGGCTGGTGAGGGATAAAGA 
GATTGGGTCTGAGI I I IGTCTCAGCI I I I I GACATTTAATTTACTAGCTCAGTAAGTCAT 
ACAAATGGGATACAAATAACACCATCTAAAACTCCAGAAGACTGGGGAGTCAGAAAAATC 
[T,G,C] 

TACCTCCTTGGGGTCCCTGCCCAGATCCCCAGTCATCTCTAGCCCTCAGGGTCCCCTCCC 
AGCTCAGCTCCTGCCCTTGGCCTCCCAAGACTC MGI I GTGCCCCAGCCCTGGGTAAAAA 
CCTCCCCTGCCCTCTGTGGGTCATAAGAAAGGCI I I I CTGGCCCTAGAGCAATGATTTGC 
TCTTTGCCTTAAGAGACTGATGAAGGTGAAACCATCTGTTCTAAGTGCTGAAAGACTGCC 
CAGGAACACACAGGGCGCTGGCTCCTGCCCTCCATGCCTAGAGGGAAACCCTGGGGAAAC 

GCCTAGAGGGAAACCCTGGGGAAACAACGGGCTTTCCTGCTTCGTGAAATTTGTCCGCAG 
AGCAAAGAGGGAGATTCTGGAGGAAGCTGCATTAGI IGI I AGTGCCCTAATCATG TTCA G 
CTACTCTAGTTGGTATGTATACTTGATTAGTCATAGCACTTATAAATAATTTATATTTTA 
TATAATATATACTTACATATTATA6ACCATTCACAGATACAAATCACACACATAAACACA 
CACCI I I ICAACAGCATTGTGAGGGACAAAGCAGGCAAAGTGAGGCTGGTTATCAGACTT 
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[T.G] 

AACAGATTAGAAAATATATTCCCAGGAGGACAGGAATTCCCCAAGGTCAGGCAGCTAGCC 
AATAGI I I I ICTAAGCTGAGTAAAACCTTCCCTGCCTCTAACGGCCCACAAAGGAGGGAA 
GACCGCGATACACACCTGTCTGGTATAAGGGGGAAGACCACAGCCGTGCT G I I I I I GTGA 
GGCAGGTAAGGGAAGGGGCAAGAGGATAAGTCATGTGTCAGGAAGCAGCGTCCAACCAGA 
GCCGGCCACCTGTCCCI I I I CCTGCCACCATGCACCAACTTTGCTGTTCAGTCACTGAAG 

20180 rrCCTGCTTCGTGAAATTTGTCCGCAGAGCAAAGAGGGAGATTCTGGAGGAAGCTGC^ 
AGTTGTTAGTGCCCTAATCATGTTCAGCTACrCTAGTTGGTATGTATACITGAT^^ 
TAGCACTTATAAATAATTTATATTTTATATAATATATACTTACATATTATAGACCATTCA 
CAGATACAAATCACACACATAAACACACACCTTTTCAACAGCATTGTGAGGGACAA^ 
GGCAAAGTGAGGCTGGTTATCAGACTTTAACAGATTAGAAAATATATTCCCAGGAGGACA 
[G.A] 

GAATTCCCCAAGGTCAGGCAGCTAGCCAATAGI I I I I CTAAGCTGAGTAAAACCTTCCCT 
GCCTCTAACGGCCCACAAAGGAGGGAAGACCGCGATACACACCTGTCTGGTATAAGGGGG 
AAGACCACAGCCGTGCTG I I 1 I I GTGAGGCAGGTAAGGGAAGGGGCAAGAGGATAAGTCA 
TGTGTCAGGAAGCAGCGTCCAACCAGAGCCGGCCACCTGTCC C I I I I CCTGCCACCATGC 
AC(>ACITTGCTGTTCAGT(>CTGAAGCTCATTCrGCACrGGCTTC^ 

20 5 84 TGTCTGGTATAAGGGGGAAGACCACAGCCGTGCTGI I I I I GTGAGGCAGGTAAGGGAAGG 

GGCAAGAGGATAAGTCATGTGTCAGGAAGCAGCGTCCAACCAGAGCCGGCCACCTGTCCC 
TTTTCCTGCCACO^TGCACCAACrrrGCTGTTCAGTCACTGAAGCTO^ 
rrCCTCCCITCCAGGCTCCAGGGGATGCrGAGCTCTCCTTGTACGACTCAGAGGAGGACA 
TTCGCAGCTACTGGGACTTAGAGCAGGTGAGCTGAGGGAAGGGGCTGTGAGGGTGGGAGC 
[A,T] 

GGGCGAAGAGGGGAAGGATGGGGTCGCTGTCAAATACAAGGCGTTCACTCAGCTGTCTCA 
CC TCCAG C CCAGA G CAGTCA CATTCAAGGCCACAAAGATTTGTGGTCATCTTTG I I M M 
TTCI I I ICCI I I ICI I I 1 I I I I I I I I I I I lAATTTGAGACAAAGTCTCACTCTATCACCC 
AGACTGGAATGCAGTGGCATGATCTCAGCTCACTGCAACCTCTGCCTCCCGGGTTCCAGA 
GGTTCTCCTGCCrCAGCCTCCCGAGTAGCrGGGACTTCAGGCCTGCGCCCAGCrAATT^ 

20717 ATGCACCAACTTTGCTGTTCAGTC^CTGAAGCTCATTCTGCACTGGC^ 

GGCTCCAGGGGATGCTGAGCTCrCCTTGTACGACTCAGAGGAGGACATTCGCAGCTACTG 
GGACTTAGAGCAGGTGAGCTGAGGGAAGGGGCTGTGAGGGTGGGAGCAGGGCGAAGAGGG 
GAAGGATGGGGTCGCTGTCAAATAC^GGCGTTCACTCAGCTGTCTCACCTCCAGCCCAG 
AGCAGTC^CATTCAAGGCCACAAAGATTTGTGGTCATCTTTGI I I I I I I I LIil ICCTTT 
[T,C] 

CM I I M I I I I I I I I I lAATTTGAGACAAAGTCTCACTCTATCACCCAGACTGGAATGCA 
GTGGCATGATCTCAGCrCACTGCAACCTCTGCCTCCCGGGTTCCAGAGGTTCTCCTGCCT 
CAGCCTCC CGAG TAGCTGGGACTTCAGGCCTGCGCCCAGCTAAI I I I IGTAI I i I lAGTA 
GAGACAGL I I I I CACC^TGTTGGCTGGGCTGGTCTCGAACTTCCGATCTCAAGCAATCTG 
CCTGCCTCGGTCTCCTAAGTGCCTGGATTACAGGCATAAGCO^CGATGCCrGGCCTTTGT 

20894 GGGGAAGGATGGGGTCGCTGTCAAATACAAGGCGTTCACTCAGCTGTCTCACCTCCAGCC 
CAGAGCAGTCACATTCA AGGCCACAAAGATTTGTGGTCATCTTTGI I I I I I M C I I I I CC 
I I I ICM I I I I II I I I I I I I lAATTTGAGACAAAGTCTCACTCTATCACCCAGACTGGAA 
TGCAGTGGCATGATCTCAGCTCACTGCAACCTCTGCCrCCCGGGTTCCAGAGGTTCTCCT 
GCCTCAGCCTCCCGAGTAGCTGGGACTTCAGGCCTGCGCCCAGCTAAI I I I IGTAI I I I I 
[A,G] 

GTAGAGACAGCI I I I C^CCATGTTGGCTGGGCTGGTCTCGAACTTCCGATCTCAAGCAAT 
CT GCCT GCCTCGGTCTCCTAAGTGCCrGGATTACAGGCATAAGCCACGATGCCTGGCCTT 
TGTTTTCATTCTTCTCACTCCCTGAAAGGCATCGTGGGGAGAGGGTGAGTCACTGGACCA 
AGTCCT^AGAGAACCAGTATCTATTCTTATTCTCCAACACATCACCCACGTGACCCTGAGC 
AAGCCACATACACCCTGGGCCCTAGI I I I I ATCATCTGTGAAATTAGGGGAAACATAGGT 

21787 GGGTGCAATGGTTCACACCTGTAATCCCAGCACTTTGGGAGGCTGAGGTGGGCGGACCAC 
CTGAGGTCAGGAGTTTGAGACCAGCCTGGCCAACATGGCGAAACCCCGTCTCTACTAAAA 
GCACAAAAATTAGCCAGGCGTAGTGGTGCATGCCTGTAGTCCCA6CTACTCGGAAGTCTG 
AGGCATGAGAATCACTT6AACCTGG6AGGCAGATGTTGCAGTGAGCCGAGATCGT6CCAC 
TGCACTCCAGCrrGGGTGACAGAGCTAGACTGTCTOWWVCAAACAAACAAAACAAAAC 
[-,A,C] 

TAAAAGATATGTGGATATGAGGGATCACCATCCCCATAGGGCCCCTGGATTAACACCACC 
CCACC^TGCCCTGAATTAAAAGAAACCAGATGACTAGGTTTGGAGAAATCTGGCTTTGG 
GTCTATGAGAAGTAGTGTCrCTCrTTGTGCCTCTTCCCATT CI I I I I GACATTGAGCTCC 
ATGGTGCTCTGAATCCGTCTCTCACAGTGCTGATGGCAGGTGGGACAGATTAGAAAATAG 
AGCTGGAGCCACAGAGATTTGGCAGACTGATTTCGGTGCCCTCTTGGAATCTCCAGCACA 

22264 CTCCATGGTGCTCTGAATCCGTCTCTCACAGTGCTGATGGCAGGTGGGACAGATTAGAAA 
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ATAGAGCTGGAGCCACAGAGATTTGGCAGACTGATTTCGGTGCCCTCrTGGAATCTCCAG 
CACATTCCAAAAAGCCTGGATAGGACCAAAATAGCTTATCAACGTGAGAAAGGACTTCAG 
AGCTTGTCTACTGCCAACCCTCATTTTACCCAATGAGGAAAGTGAAGCTATTAGGGGGCG 
AGGGACACGTGGAAGGTCACACAGCACACAGGAGGTGATTCACATGTAGATTTCAGCACC 
[T.C] 

GCTCCTGCCACGCTGGACTGGTTCACCTCCTAGGCTGACCCTGCCTCTCCCCTGTTCACA 
CACACTCTCGCACACACACACACACACACACACACACACAGGTGCI I IGI I CTGGCCAGG 
GGTTCCTAGGGTCACCrCTTGGTTGCAGCCACTGTGACCCCAACTGGTCTAACCTCTCTC 
TTCCCCTCCCACTTCCTTCCTGTGGTTCCTGCAGGAGATGTTCGGGAGCATGTTTCACG 
AGAGACCCTGACCGCCCTGTGAGGGCTCAGCCAGTCCTCATGCTGCCTACAGAGTGCCTG 

22338 ACAGAGATTTGGCAGACTGATTTCGGTGCCCTOTGGAATCTCCAGCACATTCCAAAA^ 
CCTGGATAGGACCAAAATAGCTTATCAACGTGAGAAAGGACTTCAGAGCTTGTCTACTGC 
CAACCCTCATTTTACCCAATGAGGAAAGTGAAGCTATTAGGGGGCGAGGGACACGTGGAA 
GGTCACACAGCACACAGGAGGTGATTCACATGTAGATTTCAGCACCTGCTCCTGCCACGC 
TGGACTGGTTCACCTCCTAGGCTGACCCTGCCTCTCCCCTGTTCACACACACTCTCGCAC 
[-.C,A] 

CACACACACACACACACACACACACAGGTGCI MGI I CTGGCCAGGGGTTCCTAGGGTCA 
CCTCTTGGTTGCAGCCACTGTGACCCOVACTGGTCTAACCTCTCTCTTCCCCTCCCACTT 
CCTTCCTGTGGTTCCTGCAGGAGATGTTCGGGAGCATGTTTCACGCAGAGACCCTGACCG 
CCCTGTGAGGGCrCAGCCAGTCCTCATGCTGCCTACAGAGTGCCTGGCACTTGGGACrTC 
CATAAAGGATGAGCCTGGGGTCACAGGGGGTGTCGGGCGGAGGAAAGTGCATCCCCCAGA 

2 3 363 CAGGGACCATGTGCTCTCCACACCCAGGAGTCTAGGCCTTGGTAACTATGCGCCCCCCGT 
CCATCATCCCCAAGGCTGCCCAAACCACCACTGCTGTCAGCAAGCACATCAGACTCTAGC 
CTGGACAGTGGCCAGGACCGTCGAGACCACCAGAGCTACCTCCCCGGGGACAGCCCACTA 
AGGTTCTGCCTCAGCCTCCTGAAACATCACTGCCCTCAGAGGCTGCTCCCTTCCCCTGGA 
GGCTGGCTAGAAACCCCAAAGAGGGGGATGGGTAGCTGGCAGAATCATCTGGCATCCTAG 
[T.C] 

AATAGATACCAGTTATTCTGCACAAAACI I I I GGGAATTCCTCTTTGCACCCAGAGACTC 
AGAGGGGAAGAGGGTGCTAGTACCAACACAGGGAAAACGGATGGGACCTGGGCCCAGACA 
GTCCCCCTTGACCCCAGGGCCCATCAGGGAAATGCCTCCCTTTGGTAAATCTGCCTTATC 
CI I CI I lACCTGGCAAAGAGCCAATCATGTTAACTCTTCCTTATCAGCCTGTGGCCCAGA 
GACACAATGGGGTCCTTCTGTAGGCAAAGGTGGAAGTCCTCCAGGGATCCGCTACATCCC 

2 3688 AAACI I I IGGGAATTCCTCTTTGCACCCAGAGACTCAGAGGGGAAGAGGGTGCTAGTACC 

AACACAGGGAAAACGGATGGGACCTGGGCCCAGACAGTCCCCCTTGACCCCAGGGCCCAT 
CAGGGAAATGCCTCCCnTGGTAAATCTGCCTTATCCI I CI I I ACCTGGCAAAGAGCCAA 
TCATGTTAACTCrrCCTTATCAGCCTGTGGCCCAGAGACACAATGGGGTCCTTCTGTAGG 
CAAAGGTGGAAGTCCTCCAGGGATCCGCTACATCCCCTAACTGCATGCAGATGTGGAAAG 
[G,A] 

GGCTGATCCAGATTGGGTCTTCCTGCACAGGAAGACTCTTTAACACCCTTAGGACCTCAG 
GCCATCTTCTCCTATGAAGATGAAAATAGGGGTTAAG MM CCATATGTACAAGGAGGTA 
TTGAGAGGAACCCTACTGTTGACTTGAAAATAAATAGGTTCCATGTGTAAGTGTTTTGTA 
AAATTTCAGTGGAAATGCACAGAAAATCTTCTGGCCTCTCATCACTGCI I I I CTCAAGCT 
TCTTCAGCTTAACAACCCCTTCCCTAACAGGTTGGGCTGGCCCAGCCTAGGAAAACATCC 

24210 TCACTGCI I I ICTCAAGCI ICI I CAGCTTAACAACCCCTTCCCTAACAGGTTGGGCTGGC 

CCAGCCTAGGAAAACATCCCCATTTCTAACTTCAGCCAGACCTGCGTTGTGTGTCTGTGT 
GTTGAGTGAGCTGGTCAGCTAACAAGTC M C M AGAGTTAAAGGAGGGGGTGCTGGCCAA 
GAGCCAACACATTCTTGGCCCAGGAGCATTGCI I I I CTGTGAATTCATTATGCCATCTGG 
CTGCCAATGGAACTCAAAACTTGGAAGGCGAAGGACAATGTTATCTGGGATTCACCGTGC 
[A,C] 

CAGCACCCGAAGTGCCAAATTCCAGGAGGACAAGAGCCTTAGCCAATGACAACTCACTCT 
CCCCTACTCCACCrCCTTCCAAGTCCAGCTCAGGCCCAGGAGGTGGGAGAAGGTCACAGA 
GCCTCAGGAATTTCCAAGTCAGAGTCCCCTTTGAACCAAGTATCTAGATCCCCTGAGGAC 
TTGATGAAGTGATCCTTAACCCCCAAGTAATCATTAACCCCCAGACCAGCCTCAGAACTG 
AAGGAGATTGTTGACCCAGTGACCTGGAGTTGAGGCTCAGGGAGAGATCTGCCACATGTC 
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